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INTRODUCTION 


This volume grew out of the ‘UF GALOIS THEORY WEER’, a confer- 
ence held at the University of Florida, Oct. 14-18, 1996. The conference 
was dedicated to the Inverse Galois Problem. The richness of this area 
stems from the fact that it attracts people from all kind of mathematical 
backgrounds, with methods ranging from explicit polynomial calculations 
and even numerical computer calculation to the structure and character 
theory of finite simple groups and up to the most abstract methods of al- 
gebraic/arithmetic geometry (like moduli stacks). In the original spirit of 
Galois, all these turn out to be just different aspects of the same matter. 

Here is a brief description of the contents of this volume. Abhyankar 
continues his work on explicit classes of polynomials in characteristic p > 0 
whose Galois groups comprise entire families of Lie type groups in char- 
acteristic p. In characteristic > 0, such polynomials have so far only been 
found for finitely many non-abelian simple groups except alternating groups. 
New methods for this, involving the determination of explicit equations for 
certain Hurwitz spaces, are developed in Couveignes’ paper. 

There is a way of realizing infinite series of Lie type groups as Galois 
groups over the rationals, based on Riemann’s existence theorem and the 
notion of rigidity. It does not directly yield polynomials having these groups 
as Galois groups, however. New results in that direction are obtained in the 
paper of Thompson and Volklein, using a certain generalization of rigidity 
called the braid-abelian property and resulting in Galois realizations of the 
projective symplectic groups PSp(n,q) under certain restrictions on n and 
q. 

The more abstract aspects come into play when considering the totality 
of Galois extensions — with certain properties of a given field. This 
amounts to the study of certain (profinite) fundamental groups and abso- 
lute Galois groups. Harbater studies the fundamental group of an affine 
curve in positive characteristic. He determined the finite quotients of this 
fundamental group in his proof of Abhyankar’s conjecture, but its full struc- 
ture as profinite group remains mysterious. Ihara investigates the relation- 
ship between the absolute Galois group of the rationals and the so-called 
Grothendieck-Teichmiiller group. The former is an arithmetic object. con- 
trolling the totality of Galois extensions of the rational field, whereas the 
latter is a geometric object (a certain subgroup of the automorphism group 
of the geometric fundamental group of the 3-punctured line). Ihara’s paper 
provides evidence that these two groups may not be as closely related as 
was previously hoped for. Nakamura’s article surveys related material. 

Wewers’ paper attempts to provide a bridge between the abstract and 
concrete approaches, by supplying a more elementary proof of Grothendieck’s 
results on the relationship between the fundamental group of the punctured 
line in characteristic 0 and in positive characteristic. Debes studies general 
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questions concerning the field of definition of a cover and models of the 
cover over such a field. 

The articles of Frey/Kani/Volklein and of Muller apply methods devel- 
oped for the Inverse Galois Problem to other areas. The former constructs 
infinite towers of unramified Galois curve covers defined over a fixed number 
field or finite field, with the additional property that there is a compati- 
ble system of rational points on these curves. In particular, this provides 
families of curves with ‘many’ rational points. Finally, Miller’s paper in- 
vestigates arithmetic properties of rational functions related to Hilbert’s 
irreducibility theorem. 


GALOIS THEORY OF SEMILINEAR TRANSFORMATIONS* 
By 
Shreeram S. Abhyankar 
Mathematics Department, Purdue University, West Lafayette, IN 47907, USA; 
e-mail: ram@cs.purdue.edu 


Abstract. The general linear groups GL(m,q) can be realized as Galois 
groups of certain vectorial (= g-additive) polynomials over rational function 
fields when the ground field contains GF(q¢), where m > 0 is any integer and 
q > 1 is any power of any prime p. When calculated over the prime field as 
the ground field, these Galois groups get enlarged into the semilinear groups 
[L(m,q). Similarly, for any integer n > 0, the Galois groups of the n-th it- 
erates of these vectorials get enlarged from GL(m, q,n) to TL(m, g,n) where 
GL(m,q,7) is the general linear group of the free module of rank m over the 
local ring GF(q)[T]/7” and FL(m, q,n) is its semilinearization. Likewise, a 
corresponding enlargement to the semilinear symplectic groups ['Sp(2m, q) 
happens when dealing with suitable vectorials having the symplectic simil- 
itude groups GSp(2m,q) as Galois groups. Much of this continues to hold 
when, instead of over rational function fields, the vectorials are considered 
over meromorphic function fields. A similar semilinear enlargement takes 
place when dealing with Galois groups between SL(m,q) and GL(m,q) or 
between Sp(2m,q) and GSp(2m,q). The calculation of these various Galois 
groups leads to a determination of the algebraic closures of the ground fields 
in the splitting fields of the corresponding vectorial polynomials. 


Section 1: Introduction 
Throughout this paper, let kp C K C Q be fields of characteristic p > 0 
where 22 is an algebraic closure of K, let ¢ = p“ > 1 be any power of p, let 
m > 0 be any integer, and to abbreviate frequently occurring expressions, 
for every integer i > —1, let us put 


(i) =1l+q+q°+---+¢' (convention: (0) = 1 and (-1) = 0). 
Moreover, for any nonconstant @ = (Y) € K[Y] we let 
SF(¢, A) = the splitting field of @ over AK in Q 


and 
AC(kp,¢@, K) = the algebraic closure of k, in SF(¢, KX). 


For various classes of separable ¢, we shall determine the group Gal(¢, K) 
and the field AC(kp,¢,K). Here K will mostly be a rational function field 
over k, or a formal meromorphic series field over ky. Also ¢ will mostly be 
a projective or subvectorial or vectorial polynomial over I. 


*1991 Mathematical Subject Classification: 12F10, 14H30, 20D06, 20E22. This work 
was partly supported by NSF Grant DMS 91-01424 and NSA grant MDA 904-97-1-0010. 
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2 ABHYANNAR: Galois theory of semilinear transformations 


Recall that f*(Y) (resp: ¢*(Y) or ¢*(Y)) in A[Y] is said to be a pro- 
jective (resp: subvectorial or vectorial) q-polynomial of g-prodegree 
(resp: g-subdegree or q-degree) m* (where m* > 0 is an integer) in Y 
with coefficients in K if it is of the form f*(Y) = pee avy (™-1-1) (resp: 
oY) = Marve"! or ot(¥) = OM aye”) with af © K for 
alli and aj # 0. The phrase “of qg-prodegree (resp: q-subdegree or q-degree) 
m* in Y with coefficients in K” may be dropped or may be abbreviated to 
something like “in Y over K.” Also the reference to gq may be dropped. 
Note that f*(Y) (resp: ¢*(Y) or ¢*(Y)) is monic © aj = 1, and note 
that f*(Y) (resp: 6"(Y) or $*(Y)) is separable (1.., its Y-discriminant is 
nonzero) = a},. # 0, and note that d)(Y) = ¢}(0) = a%.. where ¢}(Y) 
is the Y-derivative of ¢*(Y). Also note that f*(Y) — ¢*(Y) = f*(¥%"?) 
and @*(Y) > ¢*(Y) = Y¢*(Y) give byections of projectives to subvecto- 
rials (= their subvectorial associates) to vectorials (= their vectorial 
associates). 

To review what was said in Lemmas (2.4) and (2.5) of [A03] and Lemma 
(4.1.1) of [A08], for a moment let f = f(Y) be a separable projective 
q-polynomial of q-prodegree m over K, let ¢ = 6(Y) = f(Y%~") and ¢ = 
o(Y) = Y¢(Y), and let V be the set of all roots of ¢ in 2, and note that then 
V is an m-dimensional GF(q)-vector-subspace of 2; to see this, it suffices to 
observe that the cardinality of V is g™ and for all y.z in Q and ¢ € GF(q) 
we have (y+ z) = o(y) + o(z) and $(Cz) = Co(z). Let V be the set of all 
roots of f in Q. Then V \ {0} is the set of all roots of ¢ in 2, and yr y!?—! 
gives a surjective map V \{0} — V whose fibers are punctured 1-spaces, i.e., 
l-spaces minus the zero vector. So we may identify V with the projective 
space associated with V. In particular, fixing 0 # y € V and letting y’ 
vary over all elements of V with y/!~! = y!~! we see that y//y € A(V) 
varies over all nonzero elements of GF(q), and hence GF(q) C K(V) = 
SF(¢,K) = SF(¢, KA’). It follows that any g € Gal(K(V), A) induces an 
automorphism g’ of GF(q), and for all z € V and ¢ € GF(q) we clearly have 
g(Cz) = g/(C)g(z); since g is clearly additive on V, we see that g induces 
on V a semilinear transformation, i.e., an element of TL(V) = T'L(m,4q), 
and moreover this element belongs to GL(V) = GL(m,q) © g’ is identity. 
Thus in a natural manner Gal(¢, K) < T'L(m,q). Clearly g’ is identity for 
all g € Gal(K(V), K) < GF(q) C K, and hence in the above identification 
Gal(¢, K) < GL(m,q) & GF(q) C K. Thus we have the following: 


Semilinearity Lemma (1.1). Let f = f(Y) be @ separable projective q- 
polynomial of q-prodegree m in Y over K, let ¢ = (Y) = f(¥97!) and 
d= (Y) = Y¢@(Y), and let V be the set of all roots of ¢ in Q. Then 
V ts an m-dimensional GF(q)-vector-subspace of Q with GF(q) C K(V) = 
SF(¢,K) = SF(¢, K), and in a natural manner we may identify Gal(¢, K) 
with a subgroup of L(V) = [L(m,q); under this identification we have 


ABHYANNAR: Galois theory of semilinear transformations 3 


Gal(¢, K) < GL(m,q) > GF(q) CK. Likewise, we may identify Gall f, K) 
with a subgroup of PPL(m,q) and then Galf,K) becomes the wmage of 
Gal(¢,K) under the canonical epimorphism of TL(m,q) onto PI L(m,q). 
The Galois group Gallo, K) essentially equals the Galois group Gal(¢, K) 
except that the former acts on nonzero vectors while the latter acts on the 
entire vector space V. 


This lemma will be used tacitly. In particular, the said Galois groups will 
be regarded as subgroups of TL(V) = [L(m, 4) and its projectivization. In 
Section 2 we shall deal with vectorials whose Galois groups are between 
SL(m,q) and ['L(m,q); this will be based on [A08]. In Section 3 we shall 
deal with iterates of some of the vectorials considered in Section 2; this will 
be based on [AS1]. In Section 4 we shall deal with vectorials whose Galois 
groups are between Sp(2m,q) and T'Sp(2m, q); this will be based on [A04), 
[AL1] and [AL2]. For relevant general discussion about Galois Theory, see 
[AQ1], [A02] and [A07]. As a supplement to (1.1), in (2.5)(iii) of [A03] we 
proved the following: 


Root Extraction Lemma (1.2). Given any monic subvectorial q-poly- 
nomial @ = $(Y ) of q-subdegree m in Y over K, there exists A € SF(¢, K) 
such that A9-! = (—1)("—1) 4(0). 


When GF(q) C K, the Galois groups of the vectorials over K to be 
considered in Section 2 will be between SL(m,q) and GL(m,q). Note 
that SL(m,q) « GL(m,q) with GL(m,q)/SL(m,q) = Z,-1 and hence for 
every divisor d of g — 1 there is a unique group GL'%(m,q) such that 
SL(m,q) < GL‘! (m,q) < GL(m,q) and [GL(m,q) : GL“(m,q)] = d 
where, as usual, < and « denote subgroup and normal subgroup respec- 
tively, Z,-; denotes a cyclic group of order g — 1, and : denotes index. 
Upon letting PGL)(m, q) to be the image of GL (m,q) under the canon- 
ical epimorphism of GL(m,q¢) onto PGL(m,q) we see that PGL‘?)(m, q) is 
the unique group between PSL(m, q) and PGL(m, q) such that [PGL(m, q) : 
PGL')(m, q)] = GCD(m, 4). 

Likewise GL(m, q)<I'L(m, ¢g) with PL(m, ¢)/GL(m,q) = Z, and hence for 
every divisor 6 of u there is a unique group ['Ls(m, q) such that GL(m,q) < 
PLs(m,q) < TL(m,q) and [[Ls(m,q) : GL(m, q)] = 6, where PPLs(m, q) is 
the image of [Ls(m,q) under the canonical epimorphism of TL(m, ¢) onto 
PP'L(m,q). Also we let [SL5(m, q) be the set of all subgroups I of Ls(m, q) 
such that ITM GL(m,q) = SL(m,q) «J with I/SL(m,q) = Zs, and we let 
PLSLs(m,q@) be the set of images of the various members of [SLs5(m,q) 
under the canonical epimorphism of TL(m,q) onto PI'L(m, gq); in Remark 
(4.4.1) of [A08] we have shown that [SLs(m,q) is a nonempty complete set 
of conjugate subgroups of TL(m,q), and every I in [SLs(m,q) is a split 
extension of SL(m,q) (i.e., some subgroup of J is mapped isomorphically 
onto I/SL(m,q) by the residue class map of J onto [/SL(m,q)) such that 
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T'Ls(m, q) is generated by GL(m, q) and J. Finally we let ['L' om, q) be the 
set of all subgroups J of [Ls(m,q) such that JNGL(m,q) = GLO (m, q)aJ 
with J/GL‘)(m,q) = Zs and I < J for some I in TSL5(m,q), ae we let 
PrL\”(m, q) be the set of images of the various members of TL Dm ,q) 
under the canonical epimorphism of [L(m,q) onto PI'L(m,q); in Remark 
(4.4.1) of [A08] we have shown that PL? (m, q) is a pape a! complete 
set of conjugate subgroups of ['L(m,gq), and every J in WS ae (m,q) is a 
split extension of GL‘4)(m,q) such that [Ls(m, q) is generated by GL(m, q) 
and J; note that clearly ri Yum, q) = TSLs(m,q) and ri Yim, q) = 
{TL5(m, q)}. 

To determine the Galois groups when GF(q) is not contained in K, we 
note that SF(Y? — Y, AK) = K(GF(q)) and we let 6(K) be the unique 
divisor of u such that 
(1.3) 

Gal(Y! —Y,K) = ZK) i. equivalently [K(GF(q)) : K] = 6(K) 


and we note that then (see Footnote 17 of [A08}) 
(1.4) K NGE(q) = GF(p"/5*)). 


Concerning 6(/.), the following lemma is easily proved; see Propositions 
(4.2.3) to (4.2.5) of [A08]. 


Linear Enlargement Lemma (1.5). For any separable projective q-poly- 
nomial f = f(Y) of q-prodegreem inY over K and its subvectorial associate 
¢=¢(Y) = f(Y9~!) we have the following. 

(1.5.1) If Galo, K(GF(q))) = SL(m,q), then Gad, K) € TSL5K)(m, gq) 
and Gal(f,K) € PY SL5(K\(m,q). 

(1.5.2) If Gal(¢, K(GF(q))) = GL(m,q), then Gal(¢, K) = TLsx)(m,q) 
and Gal(f, K) = PP Ls ~)(m,q). 

(1.5.3) If Gal(¢, K(GF(q))) = GL (m,q) where d is a divisor of q —1, 
and for some field K' between K and SF(¢,K) we have 6(K’) = 6(K) 
and Gal(¢, K'(GF(q))) = SL(m,q), then Gald,K) € TL..(m,q) and 


5(K) 
Gal f,K)€ PLE | (m,q). 


In determining AC(k,,@, K) we shall use the following obvious: 


Algebraic Closure Lemma (1.6). Just in this lemma let kp C K CQ be 
fields of any characteristic, which may or may not be zero, such that Q is an 
algebraic closure of K. Let 6 = ¢(Y) be a nonconstant separable polynomial 
wn Y with coefficients in K, and let k* be an algebraic field ertension of 
kp in SF(¢,K) such that for every finite algebraic field extension k’ of k* 
in SF(o,K) we have [K(k’) : K(k*)] = [k’ : k*] and |Gal(d, K(k’))| = 
|\Galo, K(k*))|. Then AC ky; 0,K)-= k*. 


As a matter of terminology, we recall that a (noetherian) local ring S’ is 
said to dominate a local ring S if S is a subring of S’ and the maximal 
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ideal M(S) of S is contained in the maximal ideal M(S’) of S’, and we note 
that then the residue field $/M(S) of S may be identified with a subfield 
of the residue field S'/M(S’) of S‘; if under this identification, S/M(S) 
coincides with S’/M(S") then S’ is said to be residually rational over S: 
thus in particular S’ is residually rational over a subfield means that the 
subfield gets mapped isomorphically onto S’/M(S’) under the canonical 
epimorphism S’ — S'/M(S’). 

It is a pleasure to thank Paul Loomis and Ganesh Sundaram for stimu- 
lating conversations concerning the material of this paper. 


Section 2: Linear Groups 
In this Section, to write down families of polynomials whose Galois groups 
are between SL(m,q) and [L(m,q), let Y,X,7,,T2,... be indeterminates 
over ky. For every e > 0 let 


i. = RASA oczigd ad 
and 


K, =the quotient field of an (e + 1)-dimensional regular local 
domain R, with kp C R, and M(R.) = (X,T),...,Te)Re 


and for every e > 1 and 0 #7 € k,(7}) let 


Ker) = kp(X, Te, To, ore ile) 


We shall apply the considerations of Section 1 by taking K = K, or K;, or 
K(e,r) with suitable e and r. 

First, for 0 < e < m—1, consider the monic separable projective q- 
polynomial 


é 
ei er ee 
i=] 


of g-prodegree m in Y over K,, and its subvectorial associate 


é 
os =e (Y) =f yarn t+ x4) Gye! 


spam 
and, for every divisor d of g — 1, let fr and ra be obtained by substi- 
tuting (STs for X in f** and $** respectively, i.e., let 


f2'? = FLOM (Y) _ y (m-1) i (age at Sony 


#=1 
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and ,; 
prt4) = oY) = yermt 4-144 SO Ge 
2— 1, 


Next, for 1 < e < m—1 and every 0 4 7 € kp(T}) let fr"_, and 97 be 


(e,7) * T) 


obtained by substituting 7 for JT, in f** and ¢** respectively, i.e., let 


fie, Steal +X 4rvV 45 Hye 
7=2 


and 
Per) = Ore aiY) =YV aya ryiela pee ake: 
12 


abe 


and, for every divisor d of ¢ — 1. let fe , and gn, be obtained by substi- 


tuting (=1)%"- )X4¢ for X in Fair) ae oe respectively, 1.¢., let 


(7) 


pk — i Y)= yim-l) ne (ayia Uyxe 4+7Y ee Aue 
t=2 


and 


ey = Hen) HY + (ple xt gry 4 So nye 


Pre, T) 
w=2 


Finally, for 1 < e < m—1 and every 0 # 7 € kp(T;) let Seer) and $F..,) 


be obtained by substituting ((—1)(™~-")r9-!, X) for (X,7;) in f2* and ¢** 
respectively, i.e., let 


é€ 
Seis fen (¥) = Vi"? ees + xv + Te? 
i=2 


and 


fete Fe jO)= YP OE re ye +) nye. 


t=2 


Concerning these polynomials, by MRT (= the Method of Ramification The- 
ory) and MTR (= the Method of Throwing Away Roots), supplemented by 
Theorem I of [CaK] which we restate as Theorem (2.1*) below, in Theorems 


(2.3.1) to (2.3.5) of [A08] we respectively proved parts (2.1.1) to (2.1.5) of 
the following Theorem (2.1). 
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Theorem (2.1*) [Cameron-Kantor]. [fm > 2 and H < Gl(m,q) is 
such that its image under the canonical epimorphism of GL(m,q) onto 
PGL(m,q) ts doubly transitive, then erther SL(m,q) < H, or (q,m) = (4,2) 
with A7 » H < SL(4,2) = GL(4,2) & Ag (where x denotes isomorphism, 
and Az and Ag are the alternating groups on 7 and 8 letters respectively). 


Theorem (2.1). Forl1<e<m-—1 we have the following. 

(2.1.1) If GF(q) C kp, then for every element 0 # r € kp(Ti) we have 
Gal 97.7): K(e,r)) = SL(m, gq). 

(2.1.2) If GF(q) C kp, then for every element 0 # rT € kp(Ti) we have 
Gal $72 ,): K¢e,r)) = GL(m, q). 

(2.1.3) If GF(q) C kp, then for every integere>e we have Gal o**, K,) = 
GL(m, q). 

(2.1.4) If GF(q) C kp, then for every element 0 # rT € ky(T1) and every 
divisor d of q—1 we have Galo, Kier) = GL) (m, q). 

(2.1.5) If GF(q) C kp, then for every integer € > e and every divisor d 
of q—1 we have Galo. , K.) = GL‘ (m,q). 


By using the Algebraic Closure Lemma (1.6), we shall now deduce the 
following consequences of the above Theorem. 


Theorem (2.2). For1<e<m-—1 we have the following. 

(2.2.1) For every element 0 # r € kp(T) we have AC(kp, Pier) Ker) = 
kp(GF(q)). 

(2.2.2) For every element 0 # r € kp(T1) we have AC(kp, Per) Ker) = 
kp(GF(q)). 

(2.2.3) If € > e is any integer such that R, is residually rational over ky, 
then we have AC(kp, $2", K-) = kp(GF(q)). 

(2.2.4) For every element 0 # 7 € ky(T;) and every divisor d of q — 1, 
we have AC(kp, 6/6), K(e.r)) = kp(GF(q)). 

(2.2.5) If € > e is any integer such that R, is residually rational over kp, 
then for every divisor d of g—1 we have AC(kp, oo), Be) = hel GEG): 


To prove (2.2.1) or (2.2.2) or (2.2.4), let 1 <e <m-—1land047€ k(7}) 
be given, and respectively let (¢,G) = ($(.,r), SL(m, g)) or (O77 ,), GL(m, q)) 
or (e(0%,, GLO (m, q)) where in the last case d is any divisor of g—1. Upon 
letting K = K,,.,) and k* = ky(GF(q)), by (1.1) we see that k* C SF(¢, K). 
Now we have K(k") = k*(X,7,T»,...,Te) with r € k*(T,) and GF(q) C k*, 
and given any finite algebraic field extension k’ of k* in SF(¢, A) we also 
have K(k’) = k’(X,17,T2,...,Te) with r € k'(T,) and GF(q) C k’, and hence 
respectively by (2.1.1) or (2.1.2) or (2.1.4) we see that Gal(¢, K(k’)) =G= 
Gal(¢, K(k*)). For any finite algebraic field extension k’ of k* in SF(¢, A) 
we clearly have [A’(k’) : K(k*)] = [k’ : k*]. Therefore by (1.6) we conclude 
that AC(kp,¢, K) = k*. 
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To prove (2.2.3) or (2.2.5), let 1 < r < m-—1 and « > e be given, 
and respectively let (¢,G) = (¢7*,GL(m, q¢)) or (@ 4) ,GL')(m, q)) where 
in the second case d is any divisor of g — ie “toek letting A = K, and 
k* = k,(GF(q)), by (1.1) we see that kX C SF(¢, A). Moreover, upon 
letting R* to be the localization of the integral closure of R, in A(k*) 
at a maximal ideal in it we see that R? is an (€ + 1)-dimensional regular 
local domain whose maximal ideal is generated by (X,7),...,7;) and whose 
quotient field is A(k*), and we clearly have GF(g) C A(k*), and given 
any finite algebraic field extension k’ of k* in SF(¢,K), upon letting Ri 
to be the localization of the integral closure of R* in K(k’) at a maximal 
ideal in it we see that R! is an (€ + 1)-dimensional regular local domain 
whose maximal idea] is generated by (X,7,,...,7-) and whose quotient 
field is K(k’), and we clearly have GF(q) C K(k’), and hence respectively 
by (2.1.3) or (2.1.5) we see that Gal(¢, A(k’)) = G = Gal(¢, K(k*)). Now, 
assuming R, to be residually rational over k,, we see that R? is the inteeral 
closure of R, in K(k"), and R® is residually rational over &*, and given any 
finite algebraic field extension k’ of k* in SF(¢, A’), we see that Ri is the 
integral closure of R* in A(k’), and Rt is residually rational over k’, and 
also [K(k’) : K(k*)] = [k’ : k*]. Therefore again by (1.6) we conclude that 
AC(kp,¢,K) = k*. 


In Theorems (4.3.1) to (4.3.5) of [A08] we deduced the following conse- 
quences of parts (2.1.1) to (2.1.5) of the above Theorem (2.1) together with 
the Linear Enlargement Lemma (1.5). 


Theorem (2.3). For 1 <e<m-—1 we have the following. 

(2.3.1) For every element 0 #7 € ky(T1), upon letting 6 = 6(k,), we have 
Gal(or, 7) Ke,r)) ETSLs(m,q) and Gall f? (et 7 K(e,r)) € PVSL5(m,q). 

(2.3.2) For every element 0 # Tr € ky(T), upon letting 6 = d(kp), we 
have Gal( oi? ,), K(e,r)) =TLs5(m,q) and Gal ft", Reeeih= PT Ls(m,q). 

(2.3.3) for every integer € > eé, upon letting 6 = 
6(K.), we have Gal(o%*, K.) = TL5(m,q) and Gall f**, K.) = PT Ls(m,q). 
[Note that if either R, = kp{[X,Ti,...,T.]] or Re = the localization of 
Rel Xs Tei s 4 el at Whe ee ae adeal generated by (X,7\,...,7,) then R, 
ts restdually rational over ky and we have 6(K.) = 6(k,).) 

(2.3.4) For every element 0 # 7 € oe and every divisor d of q — 
1, upon letting 6 = d(kp), we have Gal(¢; as ye rt Dim, q) and 
Gall FO , Kye.) € PLL (m,q). 

(2.3.5) For every integer ¢ > € and every divisor d of q—1, upon let- 
ting 6 = 6(K,), we have Gaol, Ke) € PL‘ (m,q) and Gal (2, K K.)€ 
PLL (m,q). [Note that if either R. = kp[[X,T),..-,Te]] or Re = the local- 
wation of ky[X,T,,...,T.] at the ania ideal generated by (X,T),...,T:) 
then Re ts residually rational over ky and we have Oh) = 6(ky).] 
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Remark (2.4) [Local Surface Coverings]. 

(2.4.1). For m > 1 = e we get the trinomials f7* = Y("- 4 Ty 4+ 
xX and ¢] = yore a. T;Y%+ XY, giving local coverings above a normal 
crossing of the branch locus in the local (X,7;)-plane, dealt with in [A07] 
and [A08]; this is particularly significant with Ry = k,[[X,7\]]; the above 
Theorems (2.2.3), (2.2.5), (2.3.3) and (2.3.5) give generalizations for the 
local (e + 1)-dimensional space; the following Theorems (2.4.3) and (2.4.5) 
are ar cases of this. For m > 1 = e and r = 1 we get the trinomials 
fi, 1) yim“) + XY +(-1'"—9) and of, 1 = YO + XVI + (-1I™-Yy 
giving unramified coverings of the affine line, and the trinomials fli = 
y(m™-1) 4 y4 X and OGiy=Y a 4+ Y¢4 XY giving unramified coverings 
of the once punctured affine line, dealt with in [A03] and [A08). 


Remembering that now m > 0 is any integer, we conclude with the 
following consequences of the above theorems: 


(2.4.2). We have Galon*_,, Km-1) =TLs(m,q) and Gal f**_,, Km—1) = 
PTL s(m,q) where d=d(ky), and we have AC(ky, O7*_1, Km-1) =hp(GF(q)). 


(2.4.3). We have Galott_,, Km_1) =TLs(m,q) and Gal fx*_,, Km_1) = 
PL'Ls(m,q) where 6 = 6(Km_1), and moreover tf Rm—1 18 residually ratio- 
nal over kp then we have AC(kp, O77_;, Km-1) = kp(GF(q)). [Note that if 
etther Rm_1 = kp[[X,T1,...,Tm-i]] or Rm_-1i = the localization of 
kp[X,T1,...,Tm-—1] at the marimal ideal generated by OXg Figeiey baat) 
then Rm_1 is residually rational over kp and we have 6(Km_—1) = 6(kp).] 
(2.4.4). We have Galo, , Km_1) € PLO (m,q) and Gal fe), Km—1) € 


m—1) m 


PTL (m, q) where d is any divisor of q—1 and 6 = d(kp), and we have 
AC kp, On), Km—1) = kp(GF(q)). 


(2.4. 5). We have Gal(o7\* oe 1):€ re (m,q) and Gal 7, Km-1) € 


Pr iS?) (m,q) where d is any divisor of g—1 and 6 = 6(Km_1), and moreover 
if Rm is residually rational over ky then we have AC(kp O00, Keeak= 
kp(GF(q)). [Note that if either Rm— 1 = tolls Lijeuts 2 metlh OT timate 
the 2 of kp[X,T1,...,Tm-1] at the maximal ideal generated by 


A Ppeciale 1) hon Rm. i residually rational over kp and we have 
6(Km-1) = 6(kp).] 


Namely, everything except the assertions about AC was noted as Theo- 
tems (4.4.2) to (4.4.5) of [A08]. For m > 1, the assertions about AC are 
special cases of Theorems (2.2.2) to (2.2.5) respectively. For m = 1, it is easy 
to see that if GF(q) C k, then Gal(¢%", Ko) = GL(1,q) = Gal(¢j”, Ko) and 
Gal(¢3', Ko) = GL@(1,q) = Gal($5, Ko) for every divisor d of q — 1, 
and from this the assertions about AC follow as in the proofs of Theorems 
(2.2.2) to (2.2.5). 
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Note (2.5) [From Local Surface Coverings to Affine Line Cov- 
erings]. As hinted in (2.4.1), the family of projective polynomials f** was 
generalized from the m > 1 = e case with Ro = kp[[X,T1]] when it is 
reduced to the trinomial f7* = Y'"-!) + T,Y +X, giving a local cover- 
ing above a normal crossing of the branch locus in the local (X, 7} )-plane, 
dealt with in [A07] and [A08]. Likewise, the families of eae poly- 
nomials f7,",) and ue ,) were generalized from the m > 1 = e = T case 


when they are reduced to the trinomials f7, ,) = Y WRSPP XY 4+ (—1)09-0) 


and f/y = y(m—1) + y + X, giving unramified coverings of the affine 
line and the once punctured affine line respectively, dealt with in [A03] and 


[A08]. Out of this, the m = 2 and q = p case of Fiiay) Le., the trinomial 


Y!tP + XY +1, corresponds to the t = 1 case of the family of trinomials 
yrtt + XY'4+ 1, where ¢ is a positive integer prime to p, giving unramified 
coverings of the affine line, which was our starting point in [AQ1] and [A02]. 


Section 3: Iterated Linear Groups 
In this Section, let 


(3.1) E=E(Y)=Y +e “with X;€K and X,, £0 


be a monic separable vectorial g-polynomial of g-degree m in Y over K, 
where the elements X1,...,X, need not be algebraically independent over 
k,. When we want to assume that the elements X1,..., Xm are algebraically 
independent over ky and K = kp(Xj,...,Xm), we may express this by 
saying that we are in the generic case. In the general (= not necessarily 
generic) case, let V be the set of all roots of EF in Q, and note that then V 
is an m-dimensional GF(q)-vector-subspace of 2. Let X14,...,Xm,1 be a 
GF(q)-basis of V. Then 


yes ae = II (Y¥ — Ai X11 - +++ AmXm,1) 
‘= (Ai, Am )JEGF(q)™ 
and hence 
(3.3) kp[X1, tae Keni S kp(GF(q))[X11, tee mal 
and 
(3.4) SF(E, K) = K(V) = K(GF(q))(X11,-..-, Xm). 


As noted in (1.1), we also have 


(3.5) Gal(E, K(GF(q))) < GL(V) = GL(m, q) 
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and 


(3.6) Gal(£, K) < TL(V) = TL(m,4q). 


In the generic case, by (3.3) and (3.4) we see that the elements X11,... 
ie, 1 are algebraically independent over k,(GF(q)) and we have SF(L,K)= 
K(V) = kp(GF(q))(X1,1,---;Xm,1), and chenee AC(kp, E, K) = k,(GF(q)) 
and by (3.2) we see that every GF(q)-linear automorphism of V corresponds 
to an element of Gal(#, K(GF(q))), ie., GL(m,q) < Gal(E, K(GF(q))), 
and therefore by (3.5) we get Gal( FE, K(GF(q))) = GL(m, q) and obviously 
5(kp) = 6(K) and hence, upon letting 6 = 6(kp), in view of (1.5.2) we 
conclude that Gal(#, A) = TLs(m,q). Thus: 


Theorem (3.7). In the generic case, the elements X11,...,Xm1 are 
algebraically independent over kp, and we have SF(E,K) = K(V) = 
kp(GF(q))(X1,1)--+)Xm,1) and AC(ky, E,K) = kp(GF(q)) and 
Gal(E, K(GF(q))) = GL(m,q), and upon letting 6 = &(kp) we also have 
6 = 6(K) and Gal E, K) =TLs5(m,q). 


Remark (3.8) [Polynomial and Power Series Invariants]. Thus, 
in the generic case, the splitting field of FE over the m variable rational 
function field K = k,(Xj,...,Xm) is the m variable rational function field 
KO) = K™(X11,...,Xm1) with k* = k,(GF(q)), the groups GL(m,q) and 
PLs(m,q) with 6 = 6(k,) act on KM and, for these actions, the fixed fields 
(= the fields of invariants) are K* = k* ber ...,;Xm) and K respectively. 
Since a polynomial ring is normal (= integrally alesed in its quotient field), 
in view of (3.2) we also see that the integral closure of the m variable polyno- 


mial ring kp[X1,..., Xm] in the Galois extension KI of its quotient field K 
is the m variable polynomial ring k*[Xj1,...,Xm.], the groups GL(m, q) 
and ['Ls(m,q) act on the ring k"[X,1,...,Nyn4] and, for these actions, the 
rings of invariants are the rings k*[X),... ,Xm] and kp[X1,..., Xm] respec- 


tively, where we recall that when a group G acts on a ring J, the ring of 
invariants is the subring J° = {z € J: o(z) =z foralla € G}. Likewise, 
upon letting R and Rl") to be the m-dimensional regular local domains 
obtained by localizing k PIA Tpescy Xen) ON: KX ay 0.4 km) at the-amax- 
imal ideals generated by. Xj,...,Xm and X14,...,Xm.1 respectively, and 
upon letting R* to be the m-dimensional regular local domain obtained by 
localizing k*|X1,...,Xm] at the maximal ideal generated by Xj,...,Xm; 
we see that RU is the integral closure of R in the Galois extension KW 
of its quotient field A’, the local ring RI!) dominates the local ring R, the 
groups GL(m,q) and ['Ls(m,q) act on the ring RU! and, for these actions, 
the rings of invariants are the rings R* and R respectively. Finally, upon 
letting R and RU] to be the completions of R and RI) respectively, and 
upon letting R* to be the completion of R*, we see that the m variable 
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formal power series ring RI] = k*[[X11,...,Xm.]] is the integral closure 
of the m variable formal power series ring R = kp[[X1,...,Xm]] in the m 
variable formal meromorphic series field Kl] = k*((X4,1,-.-,Xm,1)) which 


is the splitting field of E over the m variable formal meromorphic series field 
K = kp((X1,..., Xm)) with the corresponding Galois group [Ls(m,q), the 
ee ae q) and [Ls(m,q) act on the m variable formal power series 
ring RU) = k[[X14,...,Xm.i]] and, for these actions, the rings of invariants 
are the m variable formal power series rings R* = k*[[X1,...,Xm]] and 
R=k,[[Xi,...,Xm]] respectively. Note that, continuing with the generic 
case, in the notation of Section 2 we have FE = @f*_, with (X1,...,Xm) = 
(T1,...,Tm-1,X) and K = Km-_1, and so in Theorem (3.7) we have given 
a stand-alone proof of Theorem (2.4.2). 


Remark (3.9) [Converse of Linearity]. As a partial converse of (1.1), 
we observe that if E’ = E’(¥) is any monic polynomial of degree g™ in Y 
with coefficients in Q such that the set of all roots of E’ in Q is an m- 
dimensional GF(q)-vector subspace V’ of 2, then E’ must be a vectorial 
q-polynomial of g-degree m in Y over Q. This follows from the identity 
(3.2) by substituting a GF(q)-basis of V’ for X11,...,Xmi in the RHS 
and noting that then E’ must coincide with the LHS where X,...,Xp, 
are suitable polynomial expressions in Xj 1,...,Xm1 with coefficients in 


GF(q). 


Definition-Notation (3.10). Reverting to the general case, let n > 0 
be an integer, and let El"! be the n-th iterate of E, ie., let £1) = EWU(y) = 
Y, EY = gly) = E(Y), and EU] = Ely) = £(EU-(Y)) for all 
j > 1. Let Vl = {0} c V, let VU) = V, and for every j > 1 let 
VU) be the set of “ll roots of EU in ©, and note that EY) is a monic 
separable vectorial q-polynomial of g-degree mj in Y over K and hence 
VU) is an (mj )-dimensional GF(q)-vector-subspace of Q. We get a GF(q)- 
linear epimorphism EF : Q — 2 given by z } E(z); for its j-th power 
FE) :Q + Q, where j > 0 is any es we have EJ(z) = EW](z) for all 
z € Q; likewise, for any integers j > j’ > 0, we clearly have VY] c VU and 
EF (yl) = VU-7'], and the restriction of E? to VU] gives a GF(q)-linear 
epimorphism vl — yl-F) with kernel VU’). Let F be the image of T 
under the canonical epimorphism GF(q)[7] — GF(q,n) = GF(q)[T]/T". 
For every y € Aut(GF(q)) and F = LEA rj T € GF(q,n) with ry € GF(q), 


let us put y(F) = 37") ¥(r;)T’; this gives a faithful action of Aut(GF(q)) 
on GF(q,n). For every F € GF(q,n) and z € Q we define Fz € Q by 
putting fz = Rear rj E*(z), and we note that this makes VI" a GF(q,7)- 
module, and then, for every g € Gal(K(V!"]), K) we have g(Fz) = g' (F)g(z) 
where g' € Gal(GF(q), GF(p¥/**))) is given by putting 9 (6) = -9(C) for 


| all 
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all ¢ € GF(q), and hence in its action on GF(q,n) it is given by g/(F) = 
, ey g(ri \T € GF(q, 7) for all r € GF(q)[T]. It follows that, in a natural 
manner, Gal( El), KK) is a subgroup of PLsK)(V") where, for any factor 
6 of u, by TLs(V'"!) we denote the group of all GF5(q,n)-semilinear 
automorphisms of the module yr by which we mean all additive jso- 
morphisms o : Vn] = Vl") for which there exists o’ € Gal(GF(q), GF(p"/*)) 
such that for all 7 € GF(q,n) and z € Viel we have o(nz) = a'(n)o(z); note 
that now @ + a’ gives an epimorphism T'Ls(V!"]) — Gal(GF(q), GF(p*/°)) 
whose kernel is the group GL(V!")) of all GF(q, n)-linear automorphism 
of Vin], by which we mean all additive isomorphisms o : Vit] — yl 
such that for all 7 € GF(q,n) and z € V'"l we have o(nz) = no(z), Le., 
GL(vinl) = TL,(V!7]). Observe that [L5(V'")) is a normal subgroup of 
the group [L(V!")) of all GF(q,n)-semilinear automorphisms of the 
module VI"), by which we mean all additive isomorphisms o : yin) yr) 
for which there exists o’ € Aut(GF(q)) such that for all 7 € GF(q,n) 
and z € VI"! we have o(nz) = o'(n)o(z), ie., TL(virl) = PEC ly: 
note that now o ++ o’ gives an epimorphism [L(V"1) — Aut(GF(q)) = 
Gal(GF(q), GF(p)) whose kernel is the group GL(V!"]) and which is an ex- 
tension of the above epimorphism ['Ls(V!"J) — Gal(GF(q), GF(p"/*)); also 
note that TL(V!"})/TL;(V!) = Z,/5 and TLs(VI"1)/GL(VI"]) = Zs. Now 
identifying Galois groups of polynomials with the Galois groups of their 
splitting fields we see that Gal(E!"], K) NGL(V!"]) = Gal(E!™, K(GF(q))), 
and by the usual Galois correspondence we have Gal(El"], K(GF(q))) 4 
Gal(E'™!, K) with Gal(E'™, K)/Gal(El™, K(GF(q))) = Gal(K(GF(q)), K) 


= Z5(K): Thus 
3.11) Gal(E™), K) < TLhaxy(V") aPL(vir)) 
with PL(VIM)/PLscK)(V}) = Zu sc) 
and 
(3.12) f Gale), kK) GL(Vi") = Gal(EM), K(GF(q))) «Gal(Z™), K) 
( with Gal(£l, K)/Gal(l"), K(GF(q))) = Zee) 
and 
(3.13) oe ieee eae 
with DLs K)(Vi)/GL(ve"l) = Z5(K): 


Now by (3.12) we see that 


(3.14) Gal(K(V™")), K) < GL(VI")) & GF(q) C K 
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and by (3.11) to (3.13) we see that 
{ if Gal(El"], K(GF(q))) = GL(V"")) 


3.15) 
gs then Gal( £17), AyS Phx) (V i). 


Definition- Notation (3.16). Continuing with (3.10), in the general 
case, we can first take a GF(q)-basis X)1,...,Xm 1 of V, and then iter- 
atively we can take elements X,;,...,Xm,j in Q such that for all 7 > 
1 we have E(X1;) = Xijzer,-..,E(Xmj) = Xmj-1. It follows that 
Xiny.++;Xmn 18s a free GF(q,n)-basis of Vil, and hence VI"! is a free 
GF(q,n)-module of rank m. Therefore we may identify GL(vlrl) with the 
group GL(m,q,n) of all m by m invertible matrices over GF(q,7). 
Likewise we may identify TL(V!"J) with the semidirect product [L(m, q,n) 
= GL(m,q,n) x Aut(GF(q)) where Aut(GF(q)) acts on GL(m, ¢,n) by com- 
ponentwise application of the above explained action of Aut(GF(q)) on 
GF(q,n). For any factor 6 of u, this identifies PLs(V'")) with the semidirect 
product [Ls(m,q,n) = GL(m,q,n) » Gal(GF(q), GF(p"/*)). Thus 


(3.17) GL(VI"]) = GL(m,q,n) 

and 

(3.18) L(V") = PL(m,q,n) = GL(m, q,n) « Aut(GF(q)) 
and 


TLscc)(V") = Phsccy(m, 9,7) 
= GL(m,q,n) » Gal(GF(q), GF(p*/"*))). 
Elements of GL(m,q,n) can be identified with expressions of the form £) + 


€T+---+&,-17 with é in GL(m,q) and &,...,,—1 in the set of all 
m by m matrices over GF(q), and hence 

(3.20) |GL(m, g,n)| = |GL(m, a) lg. 

As an alternative proof of (3.20), we may note that |GL(m,gq,n)| equals 
the cardinality of the set of all GF(q,n)-bases of GF(g,n)™, and for any 
finite sequence (y),...,y,) in D” where D is any local ring, upon letting 
(¥,,...,¥,) to be the corresponding sequence in D” where D = D/M(D), 
by Nakayama’s Lemma we see that (y,...,y,) is a D-basis of D” ¢ 
(¥,,-..,9,) isa D-basis of D” ; now it suffices to observe that | (GF(q,n))| 
= q'"~!), Since Aut(GF(q)) = Zy, and Gal(GF(q), GF(p4/*\*))) = Z5(K)) 


we also have 


(3.19) 


(3.21) PL(m, ¢,n)| = |GL(m, 4,2) 
and 
(3.22) ITLsxy(m,9,n)| = |GL(m,q,n)|6(K). 


In the proof of Proposition (5.3) of [A06], the following easy-to-prove fact 
was called: 


- 
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Generalized Eisenstein Criterion (3.23). Let B be a local flees dom- 
inated by an m-dimenstonal regular local domain By, let (Ci1,...,Cm1) be 
a basis of M(B1), and let A be an algebraic closure of the vapbent field L, 
of Bi. Let F(Y) = =Y%+ ae Ary?! with Ai € M(B) for1<l<d and 
Aa € M(B)? where d > 0 ts any integer. Then F(Y)—Cj,1 ts arreducible in 
L,[¥] and, upon taking C’€ A with F(C’) = C\1 and upon letting B' = the 
integral closure of B, in L1(C’), we have that B! is an m-dimensional reg- 
ular local domain with M(B’) = (C’,Co1,...,Cm1)B’, and B' dominates 
B, and is residually rational over tt. 


By double induction on z and j, or simple induction on i+ mj, as an 
immediate consequence of (3.23) we get the following: 


Corollary (3.24). Let B be a local domain dominated by an m-dimensional 
regular local domain By, let (Ci1,...,Cm1) be @ basis of M(B,), and let 
A be an algebraic closure of the quotient field Ly of By. For1<i<m and 
ee pen, tet a a = y49) + ee he nnagy Oy with Aj jl € M(B) 
for 1 <1 <:@G,j) and Ajj 43,3) € M(B)? where d(i,j) > 0 is any inte- 
ger, and let Cj,j € A be such that F, j(C;;) = Cijii. For 1 <3, S.n, let 
Ly = Ly(Cij,---,Cmj) and B; = the integral closure of By in L;. For 
Os esunend |< 4 <a. let Dig by a (Cr 5,0, Cig) and: By =the 
integral closure of B, in L;;. [Note that now for 1 < j <n we obviously 
have Log — Lj-1 and Boj = Beo4 and Lin,j = L; = Bea Cy » \e hi Cra5) 
and By; = B; = the integral closure of B;-, in L;, and forl1<i<m 
andl <j<n we obviously have Lj; = Li-1,j(Ci;) and B;; = the integral 
closure of B;_1,; in L;;.] Then for1<t<mand1 <j <n, the poly- 
nomial Fi j(Y) — Cy j-1 ts wrreducable in Lj_-1;[Y] (i.€., equevalently [L;; : 
ese qi, j)), and we have that B;; 1s an m-dimensional regular local 
domain with M(B;,;) = (Cis Shey Cig Orn one ee Cm j=1 )Bi 5; and B; ; 
dominates B;_,; and 1s residually rational over it. (e) [Hence, in particu- 
lar, for 1 <j <n, the polynomials PY \—Cigedyesj ln gl) ) — Gage 
ae irreducible in Lj_i[Y], the field degree [L; : L;_-1] equals the product 

d(1,j)...d(m, j), and we have that B; ts an m-dimensional regular local 
ae uith M(B;) = (C1j,.-.,Cmj)B;, and B; dominates B;_, and 1s 
residually rational over it.] 


To apply (3.24) tothe generic case with notation as in (3.8), take 
(By Bi, L1,A,Ci1,-..§Cma) = (R, RY, KU), OMasers, Xmi), and for 
gi Sem and 1 <oj. <n take Fy = B and Cope Xe5- For 1<jS 7 
let KU) ~ Lj; and RU) = B;. Now K H(vlel) = Kl"l and [Alnl KU] = 
Ihe; Sal +i UT. J KG | and Rate by the last bracketed part of (3.24) we see 
that [KEI(yirl ae =qg™("-)) Since KC K* Cc KE KM vir), and 
KU and (vi ")) are the respective splitting fields of F and E™! over K, 
it follows that ai Ell, K*)| = |Gal(E, K*)|q™"-») and |Gal(Z", K)| = 
IGal(B, K)|g™ (-)), Therefore by (3.7), (3-11), (3.12), (3.17), (3.19) and 
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(3.20) we get Gal(El7], K*) = GL(m,q,n) and Gal(E™, K) = 
['Ls(m,q,n). Since Ri] is residually rational over RU, by (3.7) we also see 
that AC(k,, El"l, K) = k*. So we have proved the following: 


Theorem (3.25). In the generic case, the elements X11,...,Xm 1 are 
algebraically independent over ky, and we have SFE), K) = K(virl) = 
: GR Laisine; Melek asta x ee) and AC(kp, Bl"), K) 

kp(GF(q)) and Gal El], K(GF(q))) = GL(m,q,n), and upon letteng 6 
5 (kp) we also have 6 = 6(K) and Gal El"), K) = TLs(m,q,n). 


Remark (3.26) [Iterated Polynomial and Power Series Invari- 
ants]. Thus, in the generic case, the splitting field of El] over the m 
variable rational function field K = kp(X1,...,Xm) is the field Kil — 
BM ge neg han nas oe orn). tly k= ky (GE (q)),-the groups 
GL(m,q,n) and [Ls(m,q,n) with 6 = é(k,) act on K “C4 and, for these ac- 
tions, the fixed fields are K* = k*(X1,...,Xm) and K respectively. Upon 
letting R and R* to be the m dimensional regular local domains obtained by 
localizing the polynomial rings k,[X1,...,Xm] and k*[X1,...,Xm] at the 
maximal ideals generated by X,..., Xm respectively, in view of (3.8) and 
(3.24) we see that the integral closure R) of Rin El] is an m dimensional 
regular local domain with M(RP)) aa Cee ie Crees cia the local ring 
RI"] dominates the local ring R* and is residually rational over it, the groups 
GL(m,q,n) and PLs(m,q, n) act on the local ring RI"! and, for these actions, 
the rings of invariants are the local rings R* and R respectively. Likewise, 
upon letting R and R* to be the completions of R and R* respectively, and 
upon letting RI”! to be the completion of RI"], we see that the m variable 


I 


IH 


formal power series ring RI"! = k*{[X1n,...,Xmnl] is the integral closure of 
the m variable formal power series ring R = ky|[X1,...,Xm]] in the m vari- 
able formal meromorphic series field A!) = BU Xe agnteg ca) whichis 


the splitting field of E!”) over the m variable formal meromorphic series field 
K = kp((X1,...,Xm)) with the corresponding Galois group ['Ls(m,q,n), 
the groups GL(m, q,n) and [Ls(m,q,n) act on the m variable formal power 
series ring RI”) = k[[X1.n,...;Xm,n]] and, for these actions, the rings of in- 
variants are the m variable formal power series rings R* = k*[[Xy,..., Xml] 
and R= kp[[X1,...,Xm]] respectively. 


Definition-Notation (3.27). Continuing with the generic case, for 
any divisor d of q — 1, let E\*) be obtained by substituting Gas ae go 
for X,, in E, i.e., let 


m—1 
E(4@) — E®(y)=y2" +(-1) "xd y + Sc yy 
i=l 
and let £'l"] be the n-th iterate of E!”). ; Jet Oe peidyy) = ay 


POW = PANY) = ROY) and FOU) = “pial ly) = EO) (EoOU-Uy)) 
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for all j > 1. Now clearly E(4l") is a monic separable vectorial g-polynomial 
of g-degree mn in Y over Kk, and hence by me above displayed items (3.11), 
(3.12), (3.17) and (3.19) we see that Gal( FE! trl, K(GF(q))) < GL(m,q,n) 
mad Gal(EOM"!, K) < [Ls%)(m,q,n). To determine the exact values of 
these Galois groups, we first note that SL(m,q) equals the quasi-p part 
p(GL(m, q)) (see Footnote 14 of [A08]). Moreover SL(m, ¢)4GL(m, ¢) with 
GL(m, q))/SL(m, 9) = Z,_, and, by definition, GL!) (m, q) is the unique 
group between SL(m,q) and GL(m,q) with GL(m,q))/GL?(m,q) = Za. 
In the notation just before (3.20), by sending Eg HEVT ee: Bee ae to £5 
we get an epimorphism p : GL(m,q, n) — GL(m,q) whose kernel has order 
ap). Therefore, upon letting GL{?)(m, ¢,n) = p~!(GL‘(m, q)) we see 
that GL'?)(m,q,n) is the unique normal subgroup of GL(m,q,n) for which 
we have GL(m, q,n)/GL'(m,q,n) = Za, and upon letting SL(m,q,n)) = 
u-!(SL(m, q)) we see that p(GL(m, q,n)) = SL(m,q,n) = GLY-Y(m,q,n). 
Let us also note that GL'®(m,q,n) can alternatively be characterized as 
the unique normal subgroup of GL(m,q,n) for which we have SL(m, q,n)< 
GL@®(m,q,n) with GL (m,q,n)/SL(m,q,n) = Zig-i)/d- By (3.18) we 
have GL(m, q,n)<PL(m,q,n) with TL(m, q,n)/GL(m,q,n) = Zy, hence for 
every divisor 6 of u there is a unique group ['Ls(m,q, n) between GL(m, gq, n) 
and [L(m,q,n) with TLs(m,q,n)/GL(m,q,n) = Zs. Let TSLs(m,q,n) 
be the set of all subgroups 7 of [TLs(m,q,n) such that 1M GL(m,gq,n) = 
SL(m,q.n) <7 with [/SL(m,q,n) = Zs; in Remark (3.29) we shall show 
that [SL5(m,q,n) is a nonempty complete set of conjugate subgroups of 
[L(m,q,n), and every J in TSLs(m,q,n) is a split extension of SL(m, q,n) 
such that [Ls(m,q,n) is generated by SL(m,q,n) and J. Similarly, let 
TL) (m, g, n) be the set of all subgroups J of TLs(m,q,n) such that JN 
GL(m,q,n) = GL‘? (m,q,n) <J with J/GL(m,q,n) = Zs and I < J for 
some J in TSLs(m,q,7); mm Remark (3.29) we shall show that PL (m, q,7) 
is a nonempty complete set of conjugate subgroups of FL(m,gq,n), and 
every J in PLS (m, q,n) is a split extension of GL‘?)(m,q,n) such that 
['Ls(m,q,n) is generated by GL(m,q,n) and J. 

Note that clearly TLY—)\(m, q,n) = TSLs(m,q,n) and Phi) (m,q,n) = 
{TLs(m, q, n)}. As a consequence of Theorem (3.25) we shall now deduce 
the following generalization of it: 


Theorem (3.28). In the generic case, for any divisor d of q—1 , we 
have AC(kp, Ol", K) = ky(GF(q)) and =~ Gal BOI", K(GF(q))) 
= GL (m,q.n), and upon letting 6 = 6(kp) we also have 6 = 6(K) and 
GaK EO] K) PL‘ (m,q,n). 


To deduce this, we first recall that by (1.1) we have GF(q) C K(vlrl) = 
SF(Elr), Kk), and moreover by (1.2) we can find A € K(virl) with A’~! = 
(-1)(™-)X,,. Let k* = kp(GE(q)) and K* = K(GEF(q)). Now by (3.25) 
we have AC(k,, El"), K) = k* and Gal(El”), K*) = GL(m,q,n). Let Ag = 
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AG@-D/4_ Then clearly AC(kp, Bll, K(Ag)) = AC(ky, E™, K) = k*. Also 
we have the Galois extensions K* C K*(A)C K*(V")) and K* C K*(Ag) C 
K*(A) and therefore in view of the basic Galois correspondence we see that 
Gal( El"), K*(A))sGal( El"), K* (Aq))<Gal(E Ell, K*) and Gal(El), K*(A))q 
Gal( El"), K*) with Gal( Et), K*)/Gal(E'), K*(A)) = Gal(K*(A), K*) = 
Z,-1 and Gal( El"), k*)/Gal(E nA ,K*(Ag)) = Gal(K*(Ag), K*) = Zg. As 
indicated in (3.27), SL(m,q,n) and GL“)(m, q,n) are the only normal sub- 
groups of GL(m,q,n) of indices g — 1 and d respectively, and hence we get 
SL(m,q,n) = Gal( El), K*(A)) a Gal(El"], K*(Ag)) = GL (m,q,n). We 
can find a kp(X1,..., Xm_1)-automorphism v of Q such that v(Ag) = Xn. 
Now K = v(K(Aqg)) C v(K*(Ag)) = K* C K'™ = v(K*(A)) where 
K' = K'(GF(q)) with A’ = K(v(A)), and by applying v to the coefficients 
of Ell(y) we get E'")(¥'), and therefore AC(ky, Bl], K) = k* and 
SL(m,q,n) = Gal(E'"), Kk’) a Gal(B(l"] K*) = GL (m,q,n). Since 
K C K’, we clearly have Gal(El"], kK’) < Gal(El"], K). Let 6 = 6(kp). 
Then 6 = 6(K’) = 6(K’) and hence by (3.11), (3.12), (3.17) and (3.19) we 
see that 


rf Gal( EO) kK’) Nn GL(m, q,n n= Gal( Hi"), 7 Ke) q Gal(EOl") | K! ) 
with Gal(EO@"), K')/Gal( EO"), K'*) = Zs 
and Gal(E(9"l, K’) < TLs(m,q,n) 


and 


( Gal(EOl"l, K)N GL(m,q,n) = Gal(EOl], K*) « Gal( EO], K) 
{ with Gal(EOl*] K)/Gal(BOl!, K*) = Z5 
[ and Gal(El"] K) < PLs(m,q,n) 


and therefore we have Gal(E(@l"],K’) €TSLs(m,q,n) and 
Gal(E@l, kK) € PL“ (m, q,n). 


Remark (3.29) [Semilinearity]. To verify the properties of the sets 
[SLs(m,q,n) and PL” (m, q,n) asserted in (3.27) where d and 6 are any 
divisors of g — 1 and uw respectively, we first note that, as indicated in 
(3.18), the canonical epimorphism of ['L(m,q,n) onto Aut(GF(q)) = Z,y 
with kernel GL(m,q,n) splits; to see this, as on page 79 of [A02], we 
identify TL(m,q,n) with {(g,a) : g € Aut(GF(q)) and a € GL(m,q,n)}, 
and now upon letting ['(m,q,n) = {(g,1) : g € Aut(GF(q))} we see that 
I'(m,q,n) is a subgroup of PL(m, q, n) which is mapped isomorphically onto 
Aut(GF(q)) by the canonical epimorphism of FL(m,q,n) onto Aut(GF(q)). 
Let I'5(m, q,n) be the unique subgroup of I'(m, q,7n) of order 6. Then clearly 
P'Ls(m,q,n) is generated by P's(m,q,n) and GL(m,q,n). Upon letting 
Is(m,q,n) be the subgroup of ['Ls(m,q,n) generated by ['s(m,q,n) and 
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SL(m, 9,7) we see that Is(m,q,n) € [TSLs(m,q,n). Likewise, upon let- 
ting IP (m, 4, n) be the subgroup of [Ls(m,q,n) generated by I's5(m,q,n) 
and GL‘) (m,4q,n) we see that Is(m,q,n) < J (mq, n)é€ PL‘ (m,q,n). 
Let p : GL(m,q,n) > GL(m,q) be the epimorphism described in (3.27). 
Then (g,a) > (g,det p(a)) gives an epimorphism o of T'L{m,q,n) onto 
TL (1,4) with kernel SL(m,q,n), where we identify [L(1,q) with {(g,¢) : 
g € Aut(GF(q)) and ¢ € GF(q)* = GL(1,q)}. Let GL™(1,¢), PLs(1,q), 
TSLs(1,q) and rio (1, q) be as in Section 1, let T(1,qg) = {(g,1): 9 € 
Aut(GF(q))} < TL(1,q@), let T'5(1,q) be the unique subgroup of T'(1,q) of 
order 6, let Is(1,q) be the subgroup of T'Ls(1,q¢) generated by I's(1,q) and 
SL(1,q), and let Js(1,q) be the subgroup of ['Ls(1,q) generated by I's(1, q) 
and GL‘?)(1,q). Now o induces the usual bijection between the set of sub- 
groups of TL(m,q,n) containing SL(m,q,n) and the set of all subgroups 
of [L(1,q). In particular PL (m,q,n) SAS < VLG; gn) sos) 
ru? (1,¢)}. Also we have o(T'SLs{m,q,n)) = PSLs(1,q) and we have 
o( PLS (m, q,n))= TL, q). It is easy to see that TSL5(1, q) is the set of 
all conjugates of Js(1,q) in PL(1,q), and TL’? (1,¢) is the set of all conju- 
gates of Jad, q) in TL(1,q). {From this it follows that TSL5(m, q, 7) is the 
set of all conjugates of Is(m,q,n) in TL(m,q,n), and TL) (m, q, n) 1s the set 
of all conjugates of T(m, qg,n) in 'L(m,q,n). Clearly Is(m,q,n) is a split 
extension of SL(m, q,n) such that !'Ls(m, gq) is generated by GL(m, q,n) and 
Is(m,q,n), and likewise I? (m,qyn) is a split extension of GLO (m,q,n) 
such that TLs(m,q,n) is generated by GL(m,q,n) and J? (m,q,n). 
Therefore TSLs5(m,q,n) ts a@ nonempty complete set of conjugate 

subgroups of TL(m,q,n), and every I in PSLs(m,q,n) ts a split extension 
of SL(m,q,n) such that TLs(m,q,n) is generated by GL(m,q,n) and I, 
and likewise TL (m, qn) 1s a nonempty complete set of conjugate sub- 
groups of TL£(m,q,n), and every J in ri (m,q,n) is a split extension of 
GL (m, q,n) such that TL5(m,q,n) as generated by GL(m,q) and J. 


Remark (3.30) [Generalized Iteration]. Reverting to the general 
case, let us generalize the idea of the n-th iterate El") of E by intro- 
ducing the concept of the generalized r-th iterate El) of E for any 
r=r(T) = Sor,T? € Q(T] with r; € Q (and r; = 0 for all except a finite 
number of of 7), by putting Ell = Ebl(y) = >, r, E"l(Y). Note that, for the 
Y-derivative Evy) of El](Y) we clearly have EU) ce EW }(0) =r(Xm), 
and hence if r(Xm) # 0 then El"l is a separable vectorial g-polynomial over 
Q whose g-degree in Y equals m times the T-degree of r. Also note that the 
definition of El"l remains valid for any vectorial E without assuming it to 
be monic or separable. Moreover, in such a general set-up, this makes the 
additive group of all vectorial g-polynomials FE = E(Y) in Y over Q into a 
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Q([T]|-premodule having all the properties of a module except additivity in 
E, i.e., for all r’ € Q(T] we have Elttr'] = El 4 Bl) and Err’) = (eb lyr) 
but in general for a vectorial g-polynomial E’ over Q we need not have 
(E+E srl = ell + El, At any rate, El”) of the previous notation corre- 
sponds to El7"] in the present notation. Reverting to the fixed monic sepa- 
rable vectorial # exhibited in (3.1), the said premodule structure makes (2 
into a Q[T]-module when for every r € Q[T] and z € Q we define the “prod- 
uct” of r and z to be El’](z); in particular 2 becomes a GF(q)[T]-module 
and as such we denote it by Qg. Now let us fix s = s(T) € GF(p4*/"(*))[7) 
with s(Xm) # 0 and degys = n, and note that then Ell is a separable vecto- 
rial g-polynomial of g-degree mn in Y over K. Let V[*l be the set of all roots 
of Els] in Q, and note that then V!5] is an (mn)-dimensional GF(q)-vector- 
subspace of 2. Let GF(q, s) = GF(q)[T]/(s) where (s) is the ideal generated 
by s in GF(q)[T], and let & : GF(q)[T] — GF(q,s) be the canonical epi- 
morphism. For every y € Gal(GF(q), GF(p"/°*))) and r = Yor;T" € Q(T] 
with r; € GF(q), let us put y(w(r)) = w(S>y(ri)T"); this gives a faithful 
action of Gal(GF(q), GF(p"/**))) on GF(q,s). Now Vl] is a submodule 
of Qe and as such it is annihilated by (s) and hence we may regard it as a 
GF(q,s)-module; note that then, for every r € Q(T] and z € Q, the “prod- 
uct” of w(r) and z is given by w(r ine = cle (A= mH a (z), and for every 
g€ Gal(K(V™’), Kk) we have g(w(r)z) = 9° g(r; (z)) = g'(w(r))g(z) 
where g’ € mgr on is eae by ae g'(C) = g(C) for all 
ce Cr w and hence in its action on GF(q,s) it 1s given by g/(w(r)) = 

w(>°g(ri)T") for all r € GF(q)[T]. It follows that, in a natural manner, 
erason oe is a subgroup of ene OK where, for any factor 6 of 6(K), 
by P'Ls(V!5!) we denote the group of all GF;(q, s)-semilinear automor- 
phisms of the module V!*!, by which we mean all additive isomorphisms 
o : Vis) —. VIs] for which there exists o/ € Gal(GF(q). GF(p“/°)) such that 
for all 7 € GF(q,s) and z € V5! we have a(nz) = o'(n)o(z); note that now 
o —+ a’ gives an epimorphism TL ,(V/]) — Gal(GF(q),GF(p"/*)) whose 
kernel is the group GL(V!) of all GF(q,s)-linear automorphisms of 
Vis], by which we mean all additive isomorphisms ¢ : VIS] = Vil such that 
for all n € GF(q,s) and z € VIS) we have o(nz) = no(z), ie., GL(VE) = 
PL, (Vl), 

Also note that TLs(V'')/GL(VEl) = Zs and TLs(VEl) a Phe Ky (VE) 
with TLs«)(VEI)/PLs(VEl) = Zs js. Now identifying Galois groups of 
polynomials with the Galois groups of their splitting fields we see that 


Gal(El, K) n GL(VE)) = Gal(E"!, K(GF(q))) 
and by the usual Galois correspondence we have 


Gal( El’), K(GF(q))) «Gal(E"!, K) 


ABHYANKAR: Galois theory of semilinear transformations 21 


with 
Gal(E!, K)/Gal(E"!, K (GF(q))) = Gal(K(GF(q)), K) = Zscx)- 


Thus 
(3.31) Gal(ZEl, K) < TLsc«)(V") 


and 
Gal(E“!, K) nGL(VE) = Gal(Ell, K(GF(q))) « Gal(El, K) 
(3-32) 1 with Gal(Bl, K)/Gal(E", K(GF(q))) = Zscx) 


and 
f GL(VES)) a Phere) (VE!) 


(3.33 
os) i with Ths K)(VY)/GLVE) = Zax). 


Now by (3.32) we see that 
(3.34) Gal(E“!, K) < GL(V"]) & GF(q) C K 
and by (3.31) to (3.33) we see that 


ee f if Gal(B"), K(GF(q))) = GL(V") 
\ then Gal(E"l, K) = TLecx)(V")). 

Remark (3.36) [Moore Determinant]. In his path-breaking paper of 
1896, E. H. Moore [Mor] showed that, in the generic case and assuming 
GF(q) C kp, the Galois group of the vectorial q-polynomial F’ of q-degree 
m is GL(m,q). Our proof of Theorem (3.7) was inspired by Moore’s proof. 
To describe Moore’s original proof, let us consider the determinant 


Qm(Th, ve sos »Im41) = det(T¥, 1)i,j=0,1,...,m 


where i,j are respectively the row and column indices, and 7},...,7m41 
are indeterminates over GF(q). For this determinant, which is now called 
the Moore determinant, Moore obtained the Moore factorization 


3 


Qm(Ti,..-,Tm41) = PY P(t, .-- Ti+) 


0 


~~ 


where P;(T,,...,T141) is the monic polynomial of degree g' in Ti41 with 
coefficients in GF(q)[T1,..., 77] given by 


Pi(Ty,...,Ti41) = I] (Thar ALT? + +++ + ATi). 
(Ai,.-., ArJEGF(q)! 
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Note that 


P(Ti1)=T, and P(M%,T)= [J] (-rA1N) 
41 €GF(q) 


and more generally 


Fidget) = T] 0 Ga - ait - = WT). 
(A1,.-.,ArJEGF(q)! 


To establish the said factorization, by elementary properties of determi- 


nants, replacing the m-th row [Tm41,T415--- ae by the summation 
Docc m—1(As+1 times the ¢-th row [741,7714,-.., 1 7}) we get zero. For 
every (Ai,...,Am) € GF(q)™, for the said summation we have 
J 
SS dhl EEA? oat cd 
0<i<m-1 0<i<m-1 


and hence Qm(J1,...,Tm,A1T1 +:::+Am7m) = 0. Therefore, as polynomi- 
als in Tm41, the roots of Qm and P,, coincide. Now by induction on m we 
get the desired factorization together with the fact that Qm(71,...,7m41) is 
a nonzero polynomial of degree g™ in Ty,41 with coefficients in the polyno- 
mial ring GF(q)[T1,...,Tm] and its leading coefficient is the nonzero poly- 
nomial Qm-1(T),...;Zm). Moreover, Qm(T1,...,Tm4i) 18 symmetric in 
7T1,...,Zm41 upto sign. For instance Qn.(T1,..-, Tm41) 18 a nonzero polyno- 
mial of degree g™ in T, with coefficients in GF (q)[T2,...,Tm41] and its lead- 
ing coefficient is (—1)™*!Qm_1(T2,...,;Tm41). The inductive procedure 
used for proving the Moore factorization also shows that the value of the 
Moore determinant Qm(T,...,7m41) stays nonzero when for T,,...,7m 
we substitute any elements from an overfield of GF(q) which are linearly 
independent over GF(q). By the structure of the Moore determinant we see 
that 


(*) QmlZr. --: Im, Y) = Quay (tip --, Tm) 


yr" + Ss HiY?” 
t=1 - 


with H; = A;(T1,...,Tm) € GF(q)[Ti,.-., Im]. By the Moore factorization 
we see that T),...,T are algebraic over kp(H1,...,Hm), and hence by a 
transcendence degree argument we conclude that the elements H;,...,Hm 
are algebraically independent over kp. Therefore, in view of the above 
equation (*), we may identify (X),...,Xm) with (Hj,...,Hm) and E(Y) 
with Qm(Ti\« -., Fins FV Beet as. , Tm). Now, by the Moore factoriza- 
tion, the roots of E(Y) constitute the GF(q)-vector-space V generated by 
T1,...,Tm, and hence k,(T1,...,Tm)=SF(E, K) with K =kp(X1,..., Xm), 
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d we may identify (X1,1,+--;Xm,1) with (T1,...,Zm) in view of (3.1) 
ed (3.2). Clearly every g in GL(V) gives rise to a k,-automorphism of 
kp(T1» _..;Tm) which leaves the symmetric functions H; = Xj,...,Hm = 
Xm unchanged and hence belong to Gal(E, Kk). Therefore Gal(£, K) 
eV) = GL(m,q) which proves Moore’s Theorem. As a consequence of 
the Moore factorization, we also get another proof of the partial converse of 
(1.1) noted in (3.9) to the effect that if HE’ = E’(Y) is any monic polynomial 
of degree g™ in Y with coefficients in 2 such that the set of all roots of E’ in 
Q is an m-dimensional GF(q)-vector subspace V’ of 2 then E” is a vectorial 
g-polynomial of q-degree m in Y over 2: namely, upon letting z1,...,2m 
to be a basis of V’, by the Moore factorization we see that the elements of 
V’ are the roots of Qm = Qm(21,.--,2m,¥Y) regarded as a polynomial in 
Y over GF(q)(z1,.--, 2m); now the degree of Qm in Y is q™ and its lead- 
ing coefficient is Qm—1(Z1,---;2m) # 0, and hence the given polynomial 
must be a constant multiple of Qm; by (*) it is clear that Qm is a vectorial 
g-polynomial of g-degree m in Y, and hence so is the given polynomial. 


Note (3.37) [Question on Generalized Iteration]. In the generic 
case and assuming GF(q) C kp, in Theorem (3.25) we generalized Moore’s 
Theorem by showing that, for any integer n > 0, the Galois group of the n-th 
iterate El"! of the vectorial g-polynomial E of g-degree m is the generalized 
general linear group GL(m,gq,n) consisting of all m by m matrices with 
invertible determinant over the local ring GF(q)[7]/T”. For m = 1 this 
was proved by Carlitz [Car] in 1938 as part of his explicit class field theory 
over finite fields; the case of m = 1 was further enhanced by Drinfeld [Dri] 
in 1974; for a survey of the relevant work of Carlitz and Drinfeld see Goss 
[Gos] and Hayes [Hay]. Actually, referring to (3.30) and (3.34), Carlitz 
[Car] showed that if m = 1 and s = s(T) € GF(q)[T] with s(Xm) # 0 then 
Gal(E“), kK) = GL(V')) with EU) and Vl) as defined in (3.30), and we 


may ask if this continues to hold also for m > 1. 


Section 4: Symplectic Groups 

In this Section, to write down families of polynomials whose Galois groups 
are between Sp(2m,q) and ['Sp(2m,q), let Y,X,S,T),T2,... be indetermi- 
nates over k, and, as in Section 2, for every e > 0 let 

Rig he Ti xs Fe). 

We shall apply the considerations of Section 1 by taking K = K, or Ke(S) 
with suitable e. 

First, for 0 < e < m—1, consider the monic separable projective q- 
Polynomial 

fl = plev) y= 4 gr) 4 gro) xy (m-1) 

4 sy (Se yeas 4 grimy im-1-9 ) 


i=l 
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of q-prodegree 2m in Y over K.(S) where r = (r(0),...,r(2m)) is a sequence 
of nonnegative integers with 


(4.1*) r(2m) = 0 

such that for some nonnegative integer t we have 

(4.1**) gir(m—i)=r(mti)ttg™(i-1) forO<i<m. 
Next, for 0 < e < m—1, let 


Scene | 


Bae =A yar e oe Os om xy 


€ 
+5° pein tye" ere Sd adie | 


a 
and 
= (Y) =Yol(y) =¥?" + sry 4 srm™) xy” 


> (Seer ye een y i) 


a= 1 


be the subvectorial and vectorial associates of f! respectively, and for every 
divisor d of q—1, let f$”, o6”, \” be obtained by substituting S? for S in 
f', o4, ¢! respectively, i.e., let 


fi = fLP(Y) —y (2m-1) + gr (0)d + gr(m)d yy{m-1) 


als 3 (ornare vise ni A a 


and 
go?) = = gh%(Y )= =yYV"-1 4 gr(0)d 4 gr(m)dyya™-l 


: > (pore ye Ze ones) 


i=l 
and 
g{) = gv) ayn as gr(O)dy me grim)d yya™ 


€ 
+s (ompeye” Hari lenye” }, 


t=1 
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Finally, for 0 <e< m-1, let fe, de, Oe be obtained by putting S = 1 in 
fi, dt, gt respectively, 1e., let 


f. = ey) =i {2m—1) ei se xyyim-}) 


ao 3 jaye + Ty m-1-4)) 


at 
and 
be =oe(¥Y) HY? +14 XY"! 
and 
be = o-(Y)=¥2” +¥ 4X" 
2 yD (aye &s Tye 
Note that 
(4.1") [ if r(i) = (2m - 1) — (i= 1) for 0< i < 2m 
| then conditions (4.1*) and (4.1**) are satisfied with t = q™ +1 
and 


(4.1) { if r(m +i) = 0 and r(m—i) = q™"*(i- 1) for0 <i<m 
then conditions (4.1*) and (4.1**) are satisfied with ¢ = 1. 


Case (4.1’) arises when we homogenize fe, i.e., when we put 

fi(Y) = gl2m-1) ) f.(Y/S), and so we may call it the homogeneous case. 

By analogy, case (4.1”) may be called the twisted homogeneous case. 
Recall that a bivariate ~*(Y,Z) in K[Y, Z] is said to be a bivectorial 

q-polynomial of g-bidegree (m*,m’) (where m* > 0 and m’ > 0 are 

integers) in (Y, Z) over K if it is of the form 


* ; 


(4.2) yy, 2) = S- > aye Ze with a}; € K for all z,j 


and a*.., # 0 # a%,,,, for some j’,i’; note that then, for every overfield 
L of K Gath GF(q) C L and every GF(q)-vector-subspace V* of DL, by 
Sending every (y,z) € V* x V* to w"(y,z) we get a map y" : V*xV* oD 
Which is bilinear (over GF(q)). Likewise, a bivariate ~*(Y, Z) in K[Y, Z] is 
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said to be a symplectic g-polynomial of g-symdegree (m*,n*) (where 
m”™ > n* > 0 are integers) in (Y, Z) over K if it is of the form 


m* m , F a*.- ary € K for all 2 J 
4.3) ¥"(¥,Z) = eye gF with (gM | 
(4.3) v"( ) 220" J and a?,,, = 0 for all 7’ 


and arene FO F Gyene and ay.,, = 0 = aj,,,. for all 7’ # n*; note 
that then ~%*(Y,Z) is a bivectorial polynomial of q-bidegree (m*,m”*) in 
(Y,Z) over K, and the above map vy" : V* x V* — L is alternating and 
antisymmetric. [Recall that for vector spaces V and W over any field, 
a map 6: V x V — W is alternating means 6(z,x) = 0 for all x in 
V, and antisymmetric means 6(z,y) = —6(y,z) for all y,z in V. Note 
that if @ is biadditive then for all y,z in V we have (y+ z,y+z) = 
O(y,y) + O(y, z) + A(z, y) + A(z, z) and hence if @ is also alternating then 
6(z,y) = —@(y, z) and therefore # is antisymmetric.] 
Recall that, given any vectorial g-polynomial 


2m 


(4.4) o(Y)= Soa?" with a; € K and ap #0 # am 
t=0 


of q-degree 2m in Y over K, the vectorial derivative ~(Y,Z) of ¢(Y) is 
defined by putting 


(4.5) v(Y, Z) = ¥1" 4(Z) — 27 o(Y) 


and observe that then 


(4.6) w(Y, Z) is a symplectic g-polynomial 


of g-symdegree (2m, m) in (Y, Z) over K. 


For a moment assume that ¢(Y) is separable, and let V be the set of all 
roots of ¢(Y) in Q, and note that then V is a 2m dimensional GF(q)-vector- 
subspace of Q. Let I'(Y,2Z) be a symplectic g-polynomial of g-symdegree 
(2m —1,m-— 1) in (Y,Z) over K. Then for every 0 # y € V, clearly there 
exists z € V with I'(y,z) # 0. Now assume that 7(Y,Z) = aI (Y,Z)!? — 
P(Y,Z) withO#aéK. Ifa@=1 then let @ = 1 and ifa #1 then let 3 = 
T(y’,z’)7! for some y’, z’ in V with I'(y’, z’) # 0. Note that in both the cases 
we have 6 € K(V) with 62-1 =a. For all y,z in V let b(y,z) = @T(y,z). 
Then for all y,z in V we have b(y, z)? — 6(y,z) = G[aI(y, z)? —T(y,z)] = 
Bv(y,z) = 0 and hence b(y,z) € GF(q). Therefore b: V x V — GF(q) 
is symplectic. [Recall that for a vector space V over any field, a map b 
from V x V to the field is symplectic if it is bilinear, alternating, and 
nondegenerate, where nondegenerate means that for every 0 # y € V we 
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have b(y, z) # 0 for some z € V. For any such symplectic map: a similitude 
of 6 is an element g of GL(V) for which there is a nonzero element g* of 
the field such that for all y,z in V we have b(g(y),g(z)) = g*b(y,z); an 
isometry of 6 is a similitude g of 6 with g* = 1; a semisimilitude of 6 
‘s an element g of L(V) for which there is a nonzero element g* of the 
field and an automorphism g’ of the field such that for all y, z in V we have 
b(g(v), 9(2)) = 979'((y, 2); in case the field is GF() and 6 is a divisor of 
u, the semisimilitude g is said to be of type 6 if g’ € Gal(GF(q), GF (p"/5). 
for generalities about symplectic maps see the book {KLi], especially (2.1.7) 
and 2.1.2 on pages 11 and 12 of that book.] Given any g € Gal(¢,K) = 
Gal(K(V), K), upon letting g* = 6/9(3) we have 0 # g* € GF(q) because 
oF B9-! =a € K (also if a = 1 then obviously g* = 1), and upon letting 
g'(¢) = 9(¢) for every ¢ € GF(q) we get g’ € Gal(GF(q), GF(p"/"\*))) 
such that for all y,z in V we have g(b(y.z)) = g'(b(y,z)) and hence we 
have 6(g(¥),9(2)) = ST (g(y), 9(2)) = 9" 9( BP (ay), 9(2)) = 9" 9 (BT, z)) = 
g*g9(b(y,2)) = gg‘ (b(y, z)). Thus we have proved the following refinement 
of (4.7) of [AL2]: 


Semisimilitude Lemma (4.7). Let f = f(Y) be a separable projective q- 
polynomial of q-prodegree 2m in Y over K, let@?= o(Y) = f(¥3~') andgd = 
#Y) = Y¢(Y), let ¥(Y,Z) € K[Y, Z] be the vectorial derivative of WY), 
let V be the set of all roots of d(Y) in an algebraic closure Q of K, and note 
that then V is a 2m dimensional GF(q)-vector-subspace of 2. Assume that 
there exists a symplectic qg-polynomialT'(Y, Z) of q-symdegree (2m—1,m—1) 
wn (Y,Z) over K such that (4.7*) ¥(Y,Z) = al (Y, Z)4 —T(Y,Z) with OF 
a@EK. Then for every0 #y € V there exists z € L with T(y,z) #0. Ff 
a@=1 then let 8 = 1 and ifa #1 then let B =T(y’,z')7! for some y’, z' 
in V with T(y’,z’) # 0. Note that in both cases we have @ € K(V) with 
BI-* =a. For all y,z in V let b(y,z) = BT (y,z). Thenb: V x V — GF(q) 
ts symplectic, and in a natural manner we have Gal(¢, K) < TSp(2m,q) = 
the semisimilitude group of b, so that for each g € Gall¢, K) there exists a 
unique nonzero element g* in GF(q) and a unique automorphism g' of GF(q) 
such that for all y,z in V we have b(g(y),9(z)) = g*g'(b(y, z)). Moreover, 
for every g € Galo, K) we have g’ € Gall GF(q), GF(p"/*))), and ifa = 1 
then for every g € Gal(¢,K) we also have g* = 1. Thus, in particular, of 
GF(q) C K then Gal(¢,K) < GSp(2m,q) = the simihtude group of 6, 
and if GF(q) C K anda =1 then Galo, K) < Sp(2m,q) = the tsometry 
group of b. Finally note that the Galows group Galo, K) essentially equals 
the Galois group Gad, K) except that the former acts on nonzero vectors 
while the latter acts on the enttre vector space V. 
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In the situation of (4.7), we clearly have: 


 Gal(¢, NK) < TSp5x)(2m, q) 
(4.7.1) ¢ and GSp(2m, ¢) <TSp5¢4)(2m, gq) «<TSp(2m, q) 
with PSpocic)(2m, g)/GSp(2m, a) = Zscc) 


where, for any factor 6 of u, by TSps(2m,q) we denote the group of all 
semisimilitudes of 6 of type 6. Note that, in the above notation, g + log) 
and g ++ g’ give canonical epimorphisms p : [Sp(2m,q) — T'L(1,q) and 
kK: TSp(2m,q) — Aut(GF(q)) with kernels Sp(2m,q) and GSp(2m, q) re- 
spectively, and the inverse image of Gal(GF(q), GF(p"/°)) under the second 
epimorphism « is the group [Sps(2m,q); the group [Sps(2m,q) can also 
be characterized as the unique normal subgroup of TSp(2m,q) such that 
GSp(2m, q) <TSps(2m, g) with [Sps(2m, ¢)/GSp(2m, q) = Zs; the image of 
PSps(2m,q) under the canonical epimorphism [Sp(2m,q¢) — PI'Sp(2m, ¢) 
is denoted by PISps(2m,q). By restricting « to GSp(2m,q) we get the 
canonical epimorphism GSp(2m,q) — GF(q)* = GF(q) \ {0} with ker- 
nel Sp(2m,q) = p(GSp(2m,q)) (see the lines between (5.1) and (5.2) of 
[A04]), and hence, for any divisor d of g — 1, there is a unique normal sub- 
group GSp\4) (2m, q) of GSp(2m, q) such that Sp(2m, q)aGSp" (2m, q) with 
GSp(2m, g)/GSp‘® (2m, q) = Za; the image PGSp‘® (2m, q) of GSp‘”)(2m, q) 
= Zq under the canonical epimorphism GSp(2m,q) — PGSp(2m, q) can be 
characterized as the unique group between PSp(2m, ¢) — PGSp(2m, ¢) such 
that PGSp)(2m,q) = PSp(2m,q) or PGSp(2m,q) according as d is even 
or odd; this follows from the fact that PGSp(2m,q)/PSp(2m,q) = Z> or 
Z, according as q is odd or even; note that if g is even then PSp(2m,q) = 
PGSp(2m, q); see 2.1.2, 2.1.B, 2.1.C and 2.1.D of [KLi]. In the situation of 
(4.7), by the usual Galois correspondence we also see that: 


{ Gal(¢, K) N GSp(2m, q) = Gal(¢, K(GF(q))) «Gal(¢, K) 


(4.7.2) | with Gal(¢, K)/Gal(¢, K(GF(q))) = Zscx). 


As an immediate consequence of (4.7.1) and (4.7.2) we see that, in the 
situation of (4.7): 


(if Gal(¢, K(GF(q))) = GSp™ (2m, q) 
where d is a divisor of g — 1 
(4.7.3) | then we nave 
Gal(¢, K) N GSp(2m, q) 
= GSp) (2m, q) « Gal(¢, K) < TSps¢~)(2m, q) 


. with Gal(¢, K)/GSp'?)(2m, q) = Zsx). 


Since GSp)(2m, q) =GSp(2m, g) and TSps«)(2m, q)/GSp(2m, q) = Z5(K); 
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by taking d= 1 in (4.7.3) we see that, in the situation of (4.7): 


(4.7.4) 
if Gal(¢, K(GF(q))) = GSp(2m, q) 


2? then we have 


Gal(¢, K) = PSps;%)(2m,q) and Gal(f, A) = P©Spsx)(2m, q). 


The above definitions of the various groups between Sp(2m,q) and 
[Sp(2m,q), or between PSp(2m,q) and PISp(2m,q), as well as the def- 
initions of the canonical epimorphisms p : TSp(2m,q) — TL(1,q) and 
« : PSp(2m,q) — Aut(GF(q)) with kernels Sp(2m,q) and GSp(2m, q) re- 
spectively, apply for any symplectic map b : V x V — GF(q) where V is 
any 2m dimensional vector space over GF(q). Moreover, given any such 
symplectic map, for any divisor 6 of u, let I’Sps(2m,q) be the set of all 
subgroups 7 of [Sps(2m,q) such that IM GSp(2m, q) = Sp(2m,q) «TJ with 
I/Sp(2m,q) = Zs, and let PI’Sps(2m, q) be the set of images of the various 
members of ['Sps(2m,q) under the canonical epimorphism of ['Sp(2m, q) 
onto PI'Sp(2m,q); in Remark (4.21) we shall show that I’Sps(2m,q) is a 
nonempty complete set of conjugate subgroups of [Sp(2m, q), and every 7 in 
I’Sps(2m, ¢) is a split extension of Sp(2m, ¢) such that [Sp5(2m, q) is gener- 
ated by Sp(2m, ¢) and J. Likewise, for any divisor d of g—1, let rsp\? (2m, q) 
be the set of all subgroups J of TSps(2m,q) such that J MN GSp(2m,q) = 
GSp( (2m, q) <a J with J/GSp\ (2m, q) = Zs; and I < J for some 7 in 
I’Sps(2m,q), and let PLSp\” (2m, g) be the set of images of the various 
members of Sp” (2m, q) under the canonical epimorphism of [Sp(2m, q¢) 


onto PI’Sp(2m, q); in Remark (4.21) we shall show that Sp” (2m, q) is a 
nonempty complete set of conjugate subgroups of [Sp(2m,q), and every J 


in Spi? (2m, q) is a split extension of GSp( (2m, q) such that TSp5(2m, q) 
is generated by GSp(2m,q) and J; note that clearly TSp\?-?(am, q) = 
I’Sps(2m,q) and rSp\)(2m,q) = {[Sps5(2m, q)}. 


Since GSp(¢-)(2m, q) = Sp(2m, q), by taking d = q¢—1 in (4.7.3) we see 
that, in the situation of (4.7): 


then we have 


| if Gal(¢, K(GF(q))}) = Sp(2m, q) 


(4.7.5) < 
Gal(¢, K) € I’Spsx)(2m, q) 


_ and Gal(f, K) € PI’Sps(x)(2m, q). 


Finally, because of the usual Galois correspondence, by (4.7.4) and (4.7.5) 
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we see that, in the situation of (4.7): 


( if Gal(¢, K(GF(q))) = GSp'” (2m, q) 
where d is a factor of g — 1 and 
for some field K’ between AK and SF(¢, K) we have 
(4.7.6) { 6(K') = 6(K) and Gal(¢, K’(GF(q))) = Sp(2m, q) 
then we have 


Gal(¢, K) € TSpi/}.,(2m,q) 
_ and Gal(f, A) € PI'Spicx (2m, 4). 


As an example of a vectorial derivative, for 0 < e < m—1, in (3.8) of 
[AL 1] we considered the bivariate polynomial 


(4.8) wh(Y,Z)=Y¥2" oh(Z) — ZT oY) 


and, by applying the Mantra of [A05], in (3.9) of [AL1] we obtained the 
factorization 


(4.9) PLY, Z) = S-* TU, Z)* = Ty, Z) 
where, as noted in (3.10) of [AL1], we have 


1-1 


= 3 = (ze" i lho ee ye Sept Te 


Again, as noted in (3.11) of [AL]], 


‘ for 0 <e < m-—1 we have that 


Tl is a polynomial of degree g?"~! in Z 
with coefficients in GF(p)[Y, S,T,,..., 7] and in it 


. the coefficient of the highest Z-degree term is (vst 


(4.11) 6 


Finally, as noted in (3.12) of [AL1] and (4.12) of [AL2], as a consequence 
of (4.8), (4.9) and (4.11) we have the following: 


Root Extraction Theorem (4.12). For0 <e< m-—1, there exists a 
nonzero root ye of d4(Y) i any splitting field Lt of o4(Y) over K.(S), 
and given any such y. there exists a root z. of d4(Y) in Lt such that 
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Th (yes Ze) #0, and for every such z- we have TE(ye,z-)4~' = S” (note 
Ral for any Ye € Lt and z. € L' we obviously have T¥(y.,z-) € Lt). More- 
over, for every divisor d of (q - 1)/GCD(t,q —1), there exist integers o,T 
with otg™ + Tq - 1) = (q—1)/d, and if GF(q) C kp then, given any such 
roots Yes Ze and any such integers 0,7, there exists X € GF(q) C kp such 
that for Ae = AP! (ye,Ze)°S7™ we have Ae € Lh with A? = S, and in a 
natural manner we have Gal(ge, Ke) < Galax", Ke(S)) « Galt, Ke(S)) 
Gal(g!, Ke(S))/ Gal ge”, Ke(S)) = Za. 

As another example of a vectorial derivative, for 0 < e < m-— 1, we 
consider the bivectorial polynomial 


(4.13) ve(¥,Z) =¥" $.(Z) — 22" $.(Y) 


and we note that, upon letting [.(Y,Z) be obtained by putting S = 1 in 
Ti(y, Z), by (4.9) and (4.10) we get the factorization 


(4.14) Pe(Y,Z) =T-(Y, 2)? —T.(Y,Z) 


where 


(4.15) 


Also we note that, by (4.11), 


‘ for 0 <e < m-—1 we have that 
Tr, is a polynomial of degree q?”~! in Z 
with coefficients in GF(p)[Y,71,...,7.] and in it 
. the coefficient of the highest 7-degree term is yor, 


(4.16) P 


Now, as hinted above, for 0 < e < m—1, by (4.8) and (4.13) we see that 
the bivariate polynomials wi(Y, Z) and v.(Y, Z) are the vectorial derivatives 
of the monic separable vectorial g-polynomials oh(Y) and 4.(Y) of g-degree 
2m in Y over K.(S) and K, respectively, and by (4.10) and (4.15) we see 
that FEY, Z) and Fucy, Z) are symplectic g-polynomials of g-symdegree 
(2m —1,m-—1) in (Y, Z) over K.(S) and K, respectively, and by (4.9) and 
(4.14) we see that condition (4.7*) is satisfied with a = S~" anda = 1 
Tespectively. Therefore by (4.7) we get the following restatement of (4.17) 
of [AL9] 
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Theorem (4.17). If GF(q) C kp then, for0 <e < m~1, in a natural man, 
ner we have Gaol, Ke(S)) < GSp(2m,q) and Gal(o., Ke) < Sp(2m,q). 


By various factorization arguments given in [A04], [AL1] and [AL2], ip 
(3.6) of [AL2] we concluded with the following: 


Supplemented Symplectic Rank Theorem (4.18). For1 <e < m-~ 
the groups Gal f?, Ke(S)) and Gal f., Ke) are transitive permutation groups 
of Rank 3 with subdegrees 1, q(2m — 3) and q?™—!, 1.€., each of them is g 
transitive permutation group whose one-point stabilizer has orbits of sizes 
1, g(2m — 3) and q?"-!. Moreover, ifm =2=q then |Gal( fi, Ki(S))| =0 
(mod 6!) and |Gal f;, K,)| = 0 (mod 6!). 


Recall that an antiflag in a finite dimensional vector space is a pair of 
a vector and a one-codimensional subspace not containing it. A group of 
linear transformations is said to be antiflag transitive if it transitively 
permutes all antiflags. Concerning such groups, in Theorem II of [Cak], or 
in greater detail in Proposition (6.2) of [CaK], the following result is proved: 


Theorem (4.19*) [Cameron-Kantor]. Let G < GSp(2m,q) be antiflag 
transitive and, upon letting Oa, be the natural epimorphism of GL(2m, q) 
onto PGL(2m,q), assume that Oom(G) ts a primitive Rank 3 permutation 
group, and also assume that tn case of m = 2 = q the group Oom(G) 1s 
not tsomorphic to the alternating group Ag. Then Sp(2m,q) < G. [Note 
that if G < GSp(2m,q) ts such that Oom(G) ts a transitive permutation 
group of Rank 3 then G is automatically antiflag transitive; the redundancy 
in our hypothesis is there because we are only stating a part of Theorem 
IT of [CaK]; also note that we shall use this theorem in conjunction with 
(4.18), and then the primitivity assumption can be dispensed with, because 
a transitive permutation group of Rank 3 with subdegrees 1, g(2m— 3) and 


gq?! is automatically primitive. ] 


As asserted in (5.2) and (5.3) of [AL2], as a consequence of (4.7), (4.12), 
(4.17), (4.18) and (4.19*), we have deduced the following Theorem. 


Theorem (4.19). Forl1<e<m-—1 we have the following. 

(4.19.1) If GF(q) C kp then in a natural manner we have Gal(¢., Ke) = 
Sp(2m,q) and Gal(f., K.) = PSp(2m,q). 

(4.19.2) If GF(q) C kp and GCD(t,qg—1) = 1 then in a natural manner 
we have Gal(ol, K.(S)) = GSp(2m,q) and Gal f!, K.(S)) = PGSp(2m, q). 

(4.19.3) If GF(q) C kp and GCD(t,q—1) = 1 then, for every divisor d of 
q—1, in a natural manner we have Gal (9) K.(S)) = GSp\)(2m,q) and 


Gal fi”, K.(S)) = PGSp(2m, q). 

For a moment, given any integer e with 1 < e < m—1, either let 6 = 6, 
and f = fe and K = Ke, or assuming GCD(t,qg — 1) = 1 let ¢ = of and 
f = fland K = K,(S), or assuming GCD(t,q — 1) = 1 and given any divi- 
sor dof qg—llet¢g= gi? and f = fi? and K = K,.(S). Now upon letting 
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= kp(GF(q)), by (1.1) we know that k” C SF(¢, 4), and for any finite 
algebraic field extension k’ of k* in SF(@, A’) we obviously have [K(k’) : 
K(k") = (k’ : k*], and upon letting 6 = 6(k,), we obviously have 6 = 6(K). 
In case of @ = ge, by (4.19.1) we have Gal(¢, K(k’)) = Sp(2m,q) = 
Gal(¢, K(k")) and Gal(f, K(k’)) = PSp(2m, q) = Gal(f, A(k*)), and hence 
by (1.6) we get AC(kp,¢@,K) = k*, and by (4.7.5) we get Gal(¢,K) € 
T’Sps(2m, 9) and Gal(f, A) € PI’Sps(2m,q). Similarly, in case of ¢ = ¢b, 
by (4.19.2) we have Gal(¢, K(k’)) = GSp(2m,q) = Gal(¢, K(k")) and 
Gal(f, K(k’)) = PGSp(2m,q) = Gal(f, A(4")), and hence by (1.6) we get 
AC(kp,¢,K) = k*, and by (4.7.4) we get Gal(g, AK) € TSps(2m,q) and 
Gal(f,K) € PYSps(2m,q). Finally, in case of @ = gi?) by (4.19.3) we 
have Gal(¢, K(k’)) = GSp (2m, q) = Gal(@, K(k*)) and Gal(f, K(k’)) = 
PGSp( (2m, q) = Gal(f, K(k*)), and hence by (1.6) we get AC(kp, ¢, K) = 
k*, and by (4.12) we can find A € SF(¢,A) with AG-1/d = §- now 
upon letting A’ = K(A) we obviously have 6(A’) = 6(4), and by the 
usual Galois correspondence we see that Gal(¢, A’(k*)) a Gal(@, K(k*)) 
with Gal(¢, K(k*))/Gal(¢, K’(k*)) = Gal(A"(k"), K(k*)) = Z(g-1)/a} since 
p(GSp'9) (2m, q)) = Sp(2m,q) with GSp( (2m, q))/Sp(2m, 4) = Zq—r)/a 
we must have Gal(¢, K’'(k*)) = Sp(2m, q); hence Gal(@, K) € PSp\7) (2m, q) 
by (4.7.6) and therefore Gal(f, A) € rPSp\” (2m, q). Thus we have proved 
the following Theorem. 


Theorem (4.20). For1<e<m-—1 we have the following. 

(4.20.1) We have AC(ky,¢e, Ke) = ky(GF(q)) and in a natural man- 
ner we have Gallde, Ke(GF(q))) = Sp(2m,q) and Gal(f., K.(GF(q))) = 
PSp(2m,q), and upon letting 0 = O(Ks) we have 6 = d(K,.)) and 
ina natural manner we have Gal(d., Ke) € I’ Sps(2m,q) and Gall f., Ke) € 
PY’ Sp5{2m, q). 

(4.20.2) If GCD(t,q—1) = 1 then we have AC(kp, 4, K.(S)) =kp(GF(q)) 
and in a natural manner we have Gal(¢, K.(GF(q))(S)) = GSp(2m,q) and 
Gal ft, K.(GF(q))(S)) = PGSp(2m, q), and upon letting 6 = 5(kp) we have 
6 = 6(K.(S)) and in a natural manner we have Gal(o4, K.) = TSp5(2m, q) 
and Gal fi, K.) = PU'Sps(2m,q). 

(4.20.3) If GCD(t,q — 1) = 1 then, for every divisor d of q —1, we 
have AC(kp, 00”), Ke(S)) = ky(GF(q)) and in a natural manner we have 
Galo, K.(GF(q))(S)) = GSp(2m,q) and Gall f6?, K.(GF(q))(S)) = 
PGSp) (2m, q), and upon letting 6 = 6(k,) we have 6 = 6(K,(S)) and ma 
natural manner we have Galo, Ke) € Psp\? (2m, q) and Gall fS, Ke) = 
CPSp\ (2m, q). 


Remark (4.21) [Symplectic Semilinearity]. Given any symplec- 
tic map 6: VxV—<s GF(q) where V is any 2m dimensional vector 
Space over GF(q), to verify the properties of the sets I’Sps(2m,q) and 
'Sp\ (2m, q) asserted between (4.7.4) and (4.7.5) where d and 6 are any 
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divisors of g— 1 and u respectively, we first note that the canonical epimor- 
phism « : TSp(2m,q) — Aut(GF(q)) with kernel GSp(2m,q) splits; to see 
this, we take a symplectic basis (see page 24 of [KLi]) of V, ie., a basis 
(24). -- Bee, «125 Ym) of V such that b(xi, 25) = = 6(y%,y;) = 0 for all 7,7, 
and ats = = lor 0 according asi = j ori # j, and as on page 79 of [A02] 
we identify [Sp(2m, q) with {(g,a) : g € Aut(GF(q)) and a € GSp(2m, q)}, 
and we note that then « is given by (g,a@) + g, and hence upon letting 
I’(2m,q) = {(g,1) : g € Aut(GF(q))} we see that I’(2m,q) is a sub- 
group of [Sp(2m,q) which is mapped isomorphically onto Aut(GF(q)) b 
Kk. Let [P'5(2m,q) be the unique subgroup of I’(2m,q) of order 6. Then 
clearly ['Sps(2m, q) is generated by [',(2m,q) and GSp(2m,q). Upon let- 
ting 1;(2m,q) be the subgroup of [Sps(2m,q) generated by I',(2m,q) and 
Sp(2m,q) we see that [{(2m,q) € I’Sps(2m,q). Likewise, upon letting 
TOM, q) be the subgroup of ['Sps(2m,q) generated by I(2m,q) and 
GSp'®(2m,q) we see that I/(2m,q) < I'D (2m, q) € rsp\?(2m, q). Ob- 
viously « factors through the canonical epimorphism p : TSp(2m,q) — 
PL(1,¢) with kernel Sp(2m, q). 


Let GL(1,), PLs(1,q), PSLs(1, ¢), PLS(1, 9), P(1,4), Pa(1, 9), Is(1.9) 
and Js5(1,q) be as in (3.29). Now p induces the usual bijection between the 
set of subgroups of [Sp(2m, q) containing aaa q) and the set of all sub- 


groups of ['L(1,q). In particular we have rsp\* (2m, q) = {J < TSp(2m,q) : 
plJ)€ TL" (1,9) q)}. Also we have p(I’Sps(2m,q)) = T'Lg(1,q) and we have 
o(T'Sp\4 (2m, HLS rh (1, q). Now it is easy to see that T'SL5(1,q) is the 
set of all conjugates of Is(1,q) in TL(1,q¢), and ri (1,9) is the set of all 
conjugates of JA, q) nTL(1,q). {From this it follows that I[’Sp5(2m, q) is 
the set of all conjugates of J{(2m,q) in TSp(2m, q), and rSp\? (2m, q) is the 
set of all conjugates of JD (2m, q) in TSp(2m,q). Clearly I{(2m, q) is a split 
extension of Sp(2m,q) such that [Sps(2m, q) is generated by GSp(2m, q) 
and [;(2m,q), and likewise I'am, q) is a split extension of GSp(® (2m, q) 
such that ['Sps(2m,q) is generated by GSp(2m,q) and I'D (2m, q). 
Therefore T’Sps(2m,q) is a nonempty complete set of conjugate sub- 


groups of VSp(2m,q), and every I in I’ Sps(2m,q) ts a split extension of 


Sp(2m,q) such that TSps(2m,q) is generated by GSp(2m,q) and I, and in 
d) 


the same manner l'Sp; (2m,q) ts a nonempty complete set of conjugate 


subgroups of VSp(2m,q), and every J in l'Sp\? (2m, q) 1s a split extension 
of GSp\4) (2m, q) such that T'Sps(2m,q) 1s generated by GSp(2m,q) and J. 


Remark (4.22) [Symplectic Similitude]. Referring to (4.20.1) and 
assuming GF(q) C kp, for 1 = e < m—1 we have the vectorial quintinomial 


d= YO™4TIyI" 4 XVI" 47, YI" 4Y which, in [A04], was our start- 
ing point of getting an unramified covering of the affine line over kp(T;) hav- 
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ing Sp(2m, 4) as Galois group. Referring to (4.20.2) and assuming GF(q) C 
kp, in [ALL and [AL2] ee ge to the vectorial quintinomial 

' sy! + eer ay a Sexy = srim- OT" 4 gry 
with GCD(t,q—1) = 1 giving an unramified covering of the once punctured 
affine line over kp(X,71) having GSp(2m, q) as Galois group. Referring to 
(4.20.1) but without assuming GF(q) e kp, in the excluded case of e = 0 
we get the trinomial ¢ = Y?" + XY*" +Y which can be obtained by 
substituting (2,q”) for (m,q) in the trinomial @5 mentioned in (2.4.1) and 
hence by (2.2.1) and (2.3.1) we see that: AC(kp, 60, Ko) = kp(GF(q)) and 
Gal(¢o, Ko(GF(q))) = SL(2,¢) = Sp(2,q), and upon letting 6 = 6(k,) we 
have 6 = 6(Ko) and Gal(¢o, Ko) € PSL(2,q) = PSp(2, q). 


m—1 


Note (4.23) [Division Points]. It may be noted that most of our 
equations are universal in the sense that the only coefficients involved are 
0,1,-1. Hence it may be possible to “lft” some of them to characteristic 
zero without changing their Galois groups. In particular, in this manner, 
there should result nice explicit equations over the rational numbers having 
GSp(2m,q) as Galois group. Some of these may very well match (coin- 
cide) with some of the coverings which Serre [Se2] has recently obtained 
by adjoining the coordinates of division points of abelian varieties. Indeed, 
this unpublished work of Serre [Se2] was the motivation which led us to 
deform our original Sp equations into GSp equations. The origins of this 
work of Serre may be traced back to the 1870 book of Jordan where Jordan 
generalized Jacobi’s elliptic function calculations to hyperelliptic function 
calculations. This was the birth of symplectic groups and explains why 
Jordan called them linear abelian groups in honor of Abel, who generalized 
hyperelliptic functions to abelian functions which are inverses of algebraic 
integrals. In his 1901 book [Dic] Dickson continued to follow Jordan’s ter- 
minology. It was almost seventy years later that Weyl [Wey] renamed them 
symplectic groups. Jacobi’s elliptic function calculations were brought to 
fruition by Serre in his 1972 paper [Sel]; see Theorem 19.1 on page 366 of 
Silverman’s book [Sil] for a statement of Serre’s result; this book of Silver- 
man also outlines a procedure for obtaining equations of fields attached to 
division points of elliptic curves. Likewise, Jordan’s hyperelliptic function 
calculations were greatly refined in Serre’s 1985 unpublished work [Se2] 
which is the higher dimensional extension of his 1972 paper [Sel]. Now 
Drinfeld’s work [Dri] on “Drinfeld Modules” seems to have been inspired by 
Serre’s work [Sel] on division points of elliptic curves, eventually extended 
by him [Se2] to division points of abelian varieties. In turn, our description 
of the module El! in (3.30) is based on the ideas of Drinfeld Modules. For 
a discussion of Drinfeld Modules and their relationship with division points 
of elliptic curves and abelian varieties see Goss [Gos]. Very briefly, the roots 
of the separable vectorial g-polynomial E of g-degree 2m exhibited in (3.1) 
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form a 2m dimensional GF(q)-vector-space on which the Galois group of 
E acts. The said Galois group also acts on the roots of Els! discussed in 
(3.30) which are the analogues of “s-division points of FE.” Indeed, we have 
used the letter # to remind ourselves of elliptic curves in case of m = ] 
and more generally of 2m dimensional abelian varieties. Turning the table 
around, starting with vectorial polynomials, we may visualize an m dimen- 
sional abelian variety as a characteristic zero house to be built around a 
2m dimensional GF(p)-vector space. This will match with the fact that 
the p-division points of an m dimensional! abelian variety over a field of 
characteristic different from p do form such a vector space. 
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Tools for the computation of families of 
coverings 


Jean-Marc Couveignes* 


March 5, 1999 


Abstract 


We list several techniques for efficient computation of families of 
coverings and we illutrate them on an example. 


1 Introduction 


The computation of algebraic models for coverings of the line is interesting 
both for theoretical reasons (e.g. the inverse Galois problem) and computa- 
tional ones (as a test example for computer algebra tools). In many cases 
one reduces to solving a zero dimensional algebraic system (see [22, 2, 21] 
for many examples). This can be achieved using Buchberger algorithm. For 
many reasons, however, one would like to avoid using such an expensive al- 
gorithm from the point of view of complexity. In particular, the algebraic 
system one can associate to a covering does not provide a very sharp char- 
acterization and usually admits many solutions having nothing to do with 
the initial problem. Indeed, such a system may easily encode multiplicities 
but certainly not such discrete invariants as the monodromy group. On the 
other hand, famous work by Atkin and Swinnerton-Dyer achieves quite non- 
trivial computations using methods from numerical analysis [1] and similar 
methods were applied succesfully in different contexts. Some time ago Ralph 
Dentzer asked about how to compute an algebraic model for a covering of the 
sphere ramified above four points with monodromy group Moq4 given in [20]. 
This computation was achieved by Granboulan in [14] with a lot of numerical 
methods. At that time I collected several tricks and constructions in order to 
help with this computation, but this was not published because of the length 
of the result itself and also because the computational challenge appeared 
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to be the most important. It seems that this information, however, may be 
of some use to other people performing similar computations. Also I have 
developed it a little bit further and I give in this work an illustration of it on 
a simple though non trivial example. It gave me an opportunity to consider 
these computations from a more conceptual point of view. In particular | 
realized the importance of explicit patching a la Harbater [15], and I tried to 
detail the algorithmic aspects of it for genus zero coverings. In this special 
case one can use the explicit description of the moduli spaces [12] to deal 
efficiently with not necessarily Galois coverings. The reason why patching is 
efficient is that it allows the computation of formal fibers without computing 
the extension of the base. The latter may really be huge in non-rigid cases 
(degree 144 in [14]). On the contrary, the extension of the basis is derived 
from the model for the fiber. 

The paper is organized as follows. In section 2 we present down to earth 
techniques for computing an algebraic model for a covering. These techniques 
were known to Fricke. We illustrate them on simple examples. In section 3 
we define the main family of coverings that will serve us as an example and 
start studying its combinatorial properties. The main point there is Hurwitz 
braid action. In section 4 and 5 we show how to compute an algebraic model 
for our family of coverings from the consideration of degenerate ones. Once a 
model has been computed for a degenerate cover, we first compute an analytic 
deformation of it which consists of a one parameter family of coverings, the 
parameter taking its values in a real interval. This is the purpose of section 
4 and does not require more than linear algebra computations. From this 
analytic family we derive an algebraic one in section 5. This again reduces 
to linear algebra. In section 6 we give a more general and more conceptual 
description of our method. It uses the explicit description of the compact- 
ification of moduli spaces of curves given in [12] for the case of genus zero 
curves. We explain in particular how to compute in advance the degree of the 
coefficients that appear in the algebraic model we are looking for. Geometri- 
cally, these degrees are expressed in terms of the “thickness” of intersections 
in some formal curves. 

It should be clear that the example we present is a toy that we chose for its 
simplicity and for such a small covering there exist simpler, faster methods. 

The methods presented here apply to any genus zero covering of the sphere 
minus r points. Computations will be more difficult for higher genera, how- 
ever, (except small values) because of the lack of a sufficiently explicit de- 
scription for the corresponding moduli spaces. 

We hope the reader will be convinced that the rich recent theory together 
with old computational methods make the computation of coverings much 
easier than it appears provided one does not rely too much on Buchberger’s 
algorithm, as useful as it is. 

I thank Helmut Vélklein for his careful reading of a first version of this 
work. I thank Louis Granboulan and Lily Khadjavi for several useful correc- 
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tions and comments. 


2 Two coverings ramified over three points 


In this section we shall compute an algebraic model for two coverings of the 
sphere minus three points. This will be useful in the next sections. We take 
this opportunity to recall how people have been efficiently computing simple 
coverings since the last century. 

Let P,(C) be the projective line over the field of complex numbers and 
let Ri, Ro, Rz be three distinct points on it. By a coordinate on Pi(C) we 
mean a generator of its function field. For P a point and z a coordinate we 
denote by z(P) the value of z at P. There is a unique coordinate z such that 
z(R,) = 0, 2(R2) = 1, z(R3) = 00. Let 3 be the point with z-coordinate 
equal to z(3) =7+1/2 and let E,, N. and U3 be the three loops represented 
on figure 1 in the plane with coordinate =z. 


1/241 


SSA 


Figure 1: 7(P; — { Ry, Re, R3}, 6) 


Let pi, P2, p3 be the three permutations below 


pio SL 29 465.6,7], 
p2 = ([i1,2], 
p3 = (p2p1)7'. 


Consider the covering of P,(C) —{ Ry, Ro, R3} with monodromy (1, p2, p3) 
in the basis (D1, U2, U3). 

The Riemann-Hurwitz formula shows that this is a genus zero covering 
that is a map f : P; -+ P, unramified outside {R,, Ro, R3}. This map f can 
be represented as a rational fraction F' provided we pick a coordinate on the 
left and a coordinate on the right. Let $, be the unique point above R, and 
S2 the unique point with multiplicity 1 above Rz and S3 the unique point 
with multiplicity 6 above R3. Let x be the coordinate which takes the values 
Od -and’6e at <S),..595.55 respectively. 


uw 
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We now can represent the covering f as arational fraction F(x) = z which 


satisfies 


x 


eh 

with Ay a constant and F(x) — 1 has a double zero i.e. Aya’ — (x — 1) has a 
double zero. We therefore solve the system {A,2'-—2+1=0,7A,2°—1 = 0} 
and trivialy find the unique solution x = 7/6 and Ay = 6°/77. We thus get 


F(a) hy 


6° ra 
wee Tl gl a) 
and 
Of a TYB eS co Det | 4953 1 24B2 » 120054 4 16807 
F(t)-1= ONE = gE ttt ht Sat + “ta96 + “296 ) 
fie r-1 


We observe that the coefficients in the expressions above are rational. This 
could have been deduced a prior? using a rigidity criterion. 

The reader who is familiar with the theory of Grothendieck’s dessins 
d’enfant as presented in [22] may like to see the dessin corresponding to 
f. It is the preimage of [0,1] by F’. In figure 2, bullets correspond to points 
above 0 and arrows to points above 1. 


Figure 2: Dessin 


We now consider a slightly more difficult example. Let 71, 72, 73 be the 
three permutations below 


T= [1], [2, 3,4], [5], [6,7], 
T2 = [1, 2,5, 6], 
mB O= (727) 7!. 


They define a genus zero covering g : P; —> Py which is unramified outside 
{Ri, R,, R3} with monodromy (7),72,73) in the basis (Hy, U2, Li3). We call 7; 
the unique point with multiplicity 3 above R, and 7) the unique point with 
multiplicity 4 above Rz and 73 the unique point above R3. We take y to be 
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the unique coordinate on P; which takes values 1, 0 and oo at 7, Tz and fe 
respectively. We also call 74 the unique point with multiplicity 2 above R,, 

The covering g may be represented as a polynomial function G(y) = , | 
which satisfies 


G(y) = Ka(y — 1)°(y — y(T4))°C(y) and G(y) — 1 = Koy‘ A(y) 


where C(y) is a degree 2 monic polynomial and A(y) is a degree 3 monic 
polynomial and A is a constant. We set C(y) = y? — ay+6and A(y) = 
y® — cy? + dy — e and y(T4) = f and we write the identity 


Koy - 1° (y — f)?(y? — ay +b) -— 1 = Koy*(y? — cy? +dy—e). (3) 


We differentiate the identity above with respect to the variable y and find 


(y-—Ff)(y-1)? (7y° +(-6a—4-5 fy? +(4 fa +2 f4+3a+5b)y—fa—2b-3 fb)=y3 (7y? —6cy? +5dy—4e) 


and since f # 0 we deduce that y® divides 7y? + (—6a — 4 — 5f)y? + (4fa + 
2f + 3a + 5b)y — fa — 2b—3fb and thus that 


—6a-—4—5f =0 and 4fa+2f+3a4+5b=0 and fa+ 264 3fb=0. 


We solve the system above and find that f is one of the three solutions of 


10f?+12f?+9f+4=0 (4) 
and 
: 5 2 ee 10 y) 
f eee a es * £2 baal a 
(yy=y or) aad Tags te 
and 
7 7 14 rf 7 
Aly) =o — (2 f 2 e)y? + ese Sy = 2 5 
(y=y (af + sly + (eft sly ie (5) 
and 
60 
ioe, 6 
I= TFG (6) 


Simple combinatorial considerations (see [5] or [6, Theorem 2] ) show that 
the field of moduli of our covering is real. There is a single real solution to 
equation 4. We therefore take f to be this real solution. This finishes the 
computation of a model for the covering g. We draw the corresponding dessin 
on figure 3. 
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TT T2 Ty 


Figure 3: Dessin 


3 Topological description 


In this section we describe a simple family of coverings branched over 4 points 
and we start studying it from the point of view of combinatorics. We start 
with an integer d = 7 and four partitions of d, namely P; = {1,1,2,3}, 
P, = {4,1,1,1}, P3 = {2,1,1,1,1,1}, and Py = {6,1}. Associated to these 
data we consider the set of isomorphism classes of connected coverings of the 
sphere ramified over four ordered points, of degree d and with ramification 
data given by the four above partitions in this order. The (not a priori con- 
nected) topological configuration space for such coverings is called a Hurwitz 
space. Its construction and finer ones are given in [16, 11, 10, 9, 23]. A nice 
introduction to these questions is (24, Chapter 10]. 

Our goal in this section is to obtain topological information on the Hurwitz 
space associated to the data above through a simple combinatorial study. 

We say that two permutation vectors on d letters (€,, 2, ¢3, C4) and (1%, v2, 
V3,¥4) are conjugate if there is a u € Sq such that v; = “¢; for 7 € {1,2,3,4}. 

We call P; the conjugacy class in Sy associated to the partition P;. We first 
collect all vectors of permutations on d letters (1, C2, (3,4) up to conjugacy, 
such that the following conditions hold 


1. ¢; € P; for i € {1,2,3,4} 


2. CasGo¢1 = 1 


3. the ¢; generate a transitive subgroup G of Sy. 


Being transitive of prime degree, G is primitive. Since it contains a trans- 
Position it must be the full symmetric group on 7 letters [25, Theorem 13.3]. 

There exist various formulae for counting vectors of permutations ([3, 4. 
17, 13] among many others). Unfortunately we also need to exhibit all these 
‘eee and the known methods do not provide an elegant feature for this 

ask, 

In general we may just do an exhaustive search with a computer. For the 
€xample under study, however, we can find all solutions by hand. A useful 
though trivial tool is the following “thickening” lemma. We first give a few 
natural definitions. 
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Definition 1 Let d be a positive integer and S a permutation in Sa. Let I be 
anon empty subset of {1,2,3,...,d}. We define a permutation Siz in the fol. 
lowing manner. For any x € st we call Iv the smallest positive integer k Such 
that S*(x) is in I and we set Sir(x) = S*(x). We call [x, S(x),...,S*> ‘(2)] 
the tail of x. If x is not inT we set S\z(x) = 2. We call Siz the ae 

ofS ted; f 


Lemma 1 Let ¢ and S be two permutations in Sz and let T = Supp(C) be | 
the support of ¢ and Siz the restriction of S to it. Then the restriction (CS)i7 
of the product CS is equal to the product ¢S)z. This means that the ech 
(5 is obtained from the product CSiz by replacing every x not fixed by ¢ by 
its tail. Similarly (S¢)iz = Siz¢. 


This is useful when multiplying a fixed permutation by a permutation 
whose cycle lengths depend on a few parameters. For example 


Corollary 1 Let m, n, p be three positive integers. Then the product [1,m + 
n+1,m+1]*[1,2,..,.m+n][m+n4+1,..,m+n-4 p] is [1,2,...,m][m 4 
1,..,~mtnymtntl,..,m+n4 pl]. 


Note that the restriction of a product is not the product of restrictions. 
However lemma 1 is of some theoretical interest. Indeed, consider the set A 
of infinite sequences of permutations (0,)nenn>o such that o, € S, and the 
restriction of o,4, to {1,2,...,n} is o,. This is the inverse limit of the sets 
S,, with respect to the restriction maps. This A is not a group by the above 
remark. However, lemma | implies that the group S,. of permutations of the 
positive integers with bounded support acts on A by action on coordinates. 
Indeed for any permutation 7 of degree m and any o = (o,), in A we define 
T.o as follows. For any n > m set i, = To, and for any n < m take p, to be 
the restriction of uw», to {1,2,...,n}. Then 7.0 = (fn)n is in A. One can also 
define a right action. These actions clearly have no fixed points. 

Elements of A admit a more geometric description. Consider pairs of 
the form (U,c) where Y is a finite or enumerable disjoint union of oriented 
circles and ¢ is an injection of the set of positive integers into U¢. Two such 
pairs (U4;,¢;) and (U2, 22) are said to be equivalent if and only if there is an 
orientation preserving homeomorphism h from U4, to U2 such that 12 = hoy. 
We call such an equivalence class a propermutation. The left and right actions 
of S.. on propermutations can be seen as cutting and glueing circles. 

As elementary as they are, these considerations allow mental computation 
with permutations. 

We proceed as in [7] and find that there are exactly 48 vectors satisfying 
conditions 1, 2, 3 above (up to conjugacy). We give these vectors in the 
following definition in which a residue class modulo a positive integer N is 
identified with its smallest positive element. 
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Definition 2 For any k mod7 a residue class modulo 7 we denote by a; the 
yector (C15 C2; G35 Ca) with 


C1 = [1], [2], (3, 4], [5, 6, 7], 

C2 = [1, 2,3, 5], 

¢; = [kmod7,k+1 mod 7], 
C4 = (€3¢201)7*. 


We say that the 7 such vectors form the A family. 
For any kmod7 a residue class modulo 7 we denote by by the vector 


(C1, 62,63» C4) with 


4: = [1], [2].[3,4,5], [6,7]. 

@ = [1,2,3,6], 

¢3 = [kmod7,k+1 mod 7], 
Cg = (€3€201)7?. 


We say that the 7 such vectors form the B family. 
For any k mod7 a residue class modulo 7 we denote by cy, the vector 


(1, C2, (3; (4) with 


¢: = [1],[2,3,4], [5], [6,7], 

@ = (1, 2,5,6], 

¢3 = [kmod7,k +1 mod 7], 
Go = (€3@¢)7'. 


We say that the 7 such vectors form the C family. 
For any k mod 3 a residue class modulo 3 we denote by d, the vector 


(01,¢2,¢3, Ca) with 


C1 = (1], (2], [3, 4], [5,6, 7], 
C2 = [1,2,3,4], 

¢3 = [k mod3,5], 

te <=. (Gc). 


We say that the 3 such vectors form the D family. 
For any k mod3 a residue class modulo 3 we denote by e, the vector 


(G1, (2, (3; C4) with 


4. = [1],{2],[3,4,5], [6,7], 
G@ = ([1,2,3,5], 

¢3 = [k mod 3,6], 

= (GGd)"": 


We say that the 3 such vectors form the E family. 
For any k mod 5 a residue class modulo 5 we denote by f, the vector 


(C1, (2, (3, Ca) with 


a = (1), [2,3], [4,5, 6], [7], 
C2 = ([1,2,4,6], 
¢3 = [k mod 5,7], 


C4 = (€3€20)7?. 


We say that the 5 such vectors form the F family. 
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For any k mod 5 a residue class modulo 5 we denote by gx the vecto, 
(Cis 2.03. C4) with 


C= (1,21 03114,5,6), 7 
€2 = (1,3, 4,6], 
C3 = [kmod5,7], 
C4 = (€a¢2¢,)7?. 


We say that the 5 such vectors form the G family. 
For any k mod 5 a residue class modulo 5 we denote by hy the vectop 


LCas G2; (3, C4) with 


C1 = (1], {2,3, 4], [5,6], [7], 
6 = [1, 2,5, 6}, 

¢3 = [k mod 5,7], 

Co = (¢3G2:)7’. 


We say that the 5 such vectors form the H family. 
For any k mod 5 a residue class modulo 5 we denote by 1, the vector 


(C1, 62,63, 64) wath 


¢: = [1,2,3], [4], [5,6], [7], 
G = [1,4,5,6], 

¢3 = [kmod5,7], 

Cs = (¢3¢2.)7?- 


We say that the 5 such vectors form the I family. 


We now compute the action of braids on these 48 vectors. To this end 
we consider the configuration space Xo, = P,‘ — A of spheres minus four 
pairwise distinct points. A point Q = (Q1, M2, Q3, M4) in Xo,4 corresponds to 
the sphere P;—(Q 1, Q2, Q3, Q4). If Z is a coordinate on P,; we denote by Z(Q) 
the vector (Z7(Q1), Z(Q2), Z(Q3), Z(Q4)) and we call it the Z-coordinate of 
(). We pick such a coordinate Z and choose as a base point for Xo,4 the point 
P=(P,,P2, Ps, Ps) with Z-coordinate Z(P) = (0,1,2,00). We also choose a 
base point 6 on the corresponding sphere P, — (P,, P2, P3, Py). We take for b 
the whole upper half plane in the Z-coordinate. This makes sense because the 
upper half plane is a contractible set. We also pick generators (1,,T2, 3,14) 
for (Py — {Py, Po, P3, Pa}, 6) as on figure 4. 

The fundamental group 7;(Xo,4, P) is generated by braids t1,2, to3, t3,4 
defined in the classical way. For example t,,. is represented by the map 
t1,2(u) with Z-coordinates Z(t,,2(u)) = (1/2 — 1/2e""", 1/2 + 1/2e7***, 2, 00) 
for u € [0,1]. The action on monodromy vectors in these basis is then given by 
t1,2((C1, C2, G3, Ca)) = (62510, $%'Ca, C3, C4). See [16]. Straightforward calculation 
then gives the following fact. 


Fact 1 For any k a residue modulo 7 


ti2(ae) = @e-1 
ty2(bk) = be-1 
ti2(ck) = Ck-1- 
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Figure 4: 7,(Pi — {Pi, Po, Ps, Ps}, 6) in the Z-coordinate 
For any k a residue modulo 3 
ti2(dk) = dg. 


For any k a residue modulo 4 


ti 2(ex) = Cheais 


For any k a residue modulo 5 


ti2(fe) = fro 
ti2(9k) = 9k-1 
ti2(Ax) = hea 
tio(te) = te-y. 


The action of tz.3 on our 48 vectors is given below (trivial cycles are omit- 


ted) 


[B1,B2,Cs][Cy,A1,A2][C7,f1 ,D2,Hs,Ce][As,As,D1JaIs} 


[A7 Fi Fo ,£3,As][B7,A1,H2,D3,Be][Bs,Bs,£1,G3,G4][C2,C4,Gi ,E2,F4]. 


The above fact gives us the combinatorial description of the configuration 
space M parametrizing our family. Since the coverings we consider have no 
automorphisms we even have a covering of universal curves: 


Mit 


® 


Xo,5 


A 


Xo,4 


Following [8] we embed the moduli space of spheres minus four points 
Mo4 = P,— {Ry, Ro, R3} in the configuration space Xo,4 as the subvariety of 
Points (R), Ro,Q, Rs) for Q € P; — {R,, Rz, R3}. The restriction of A to the 
Curve Mo 4 is a covering of curves also called A: # > P, —{Ri, Ro, Rs}. The 
Curve H is the moduli space of our family of coverings and is often called a 
Hurwitz space. This Hurwitz space is mapped by A onto the moduli space 
Mo,4 of spheres minus four points. 
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From fact 1 and the Hurwitz formula we deduce that the curve H has 
genus zero. Recall that z is the coordinate such that 2(A,) = 0, z(R2) = i 
and z(f3) = oo. The preimage by A of the segment consisting of points With 
=-coordinates in [1, oc] is a connected graph on the sphere which we represen; 
below. The bullets correspond to points above 00 and the other vertices to 
points over 1. The points above 0 are associated with faces. To any point 
above oo corresponds one of the nine families A, B,C, D,E,F,G,H, and J. 


5a 
; ~ 7 


Figure 5: The Hurwitz space H 


4 Patching 


In this section we compute an analytic model for the family of coverings 
presented in section 3. 

Let again P,(C) be the projective line over the field of complex numbers 
and let U,, U2, U3. Uy be four distinct points on it. We call Z the unique 
coordinate such that Z(Ui) = 0, Z(U2) = 1, Z(Us4) = oo. We also set 
A = Z(U3) and y = 1/X. For convenience we introduce another coordinate 
W = Z/X such that W(U,) = 0, W(U2) = x, W(U3) = 1, and W(U4) = oo. 

Assume first that A is a real greater than 1 and let us choose as a base 
point 6 the whole upper half plane in the Z-coordinates (which is also the 
upper half plane in the W-coordinate since Z = AW and 4 is real positive). 
We also pick generators (0), Q2, Q3, Q4) for 7,(Py(C) — {U1, U2, U3, Uy}, b) as 
represented on figure 6. Note that these data depend continuously on \. For 

= 2 we find ourselves in the situation of figure 4. 
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©4 


Figure 6: 7(Pi(C) — {U1, U2, U3, Uy}, 6) in the Z-coordinate 


We define pu = K3x1/" where A is a constant that will be chosen most 
conveniently latter and y1/" is the unique real seventh root of y. The reason 
for introducing this yu is that we are going to study the Hurwitz space H 
locally at the point C’ represented on figure 5. This point corresponds to 
the C family and it is mapped onto R3 € P, by the Hurwitz map A. The 
ramification index of A at Cis 7. Therefore py is a local parameter at C on the 
Hurwitz space and we expect all coordinates arising in the algebraic model 
we are looking for to be Laurent series in p. 

For any  €]0,1[ we call 6, : P; > P, the covering with monodromy 


Ct = (G1, G2, C3, Ca) 


C1 = [1], [2,3, 4], (5), (6, 7], 
62 pa [1, 2,5, 6], 

¢3 = [1,5], 

Go = (3¢201)7?. 


in the basis of 7,(P, — {U,, U2,U3,U4},6) given in figure 6. Again, one can 
show that this covering is defined over the field of real numbers (see [(5, 6]). 
We call V, the unique point above U; with multiplicity 2 and V2 the unique 
point above U; with multiplicity 3. We call V3 the unique point above U2 
with multiplicity 4 and V4 the unique point above U3 with multiplicity 2 and 
Vs the unique point above U, with multiplicity 1 and Ve the unique point 
above U, with multiplicity 6. 

We now try to understand what happens when ju tends to zero. 

If we look at things from the point of view of W-coordinates we see that 
W(U2) = y tends to 0 = W(U,) while W(U3) = 1 and W(U,) = oo. In 
the limit we get a sphere minus three points U, = U2, U3 and U4, and a 
fundamental group ™(Pi(C) — {U; = U2,U3,Us},6) generated by T12 = 
PT, P3,P4 as on figure 7. Indeed, when U, and U2 coalesce, turning around 
the resulting point is just turning around U, then Uz. Topologically, this is 
Equivalent to punching a big grey hole containing U; and U2 or equivalently 
removing the segment [U, U2]. Turning around this big hole is equivalent to 
turning around U; and then around U4. 
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Figure 7: 7(P; — {U; = U2, U3, Us)},}) 


We conclude that when y tends to zero the covering ©, tends to a covering 
of the sphere minus three points with monodromy ((2¢).¢3, ¢4) in the basis 
(Ui 2,03, 04) of m1 (P) — {U; = U2, U3, Us}, 6) shown on figure 7. Coming back 
to section 2 we see that ¢2¢; = p, and ¢3 = p2 and (4 = p3. Therefore 4, 
tends to f when yu tends to zero. 

In order to take advantage of this we shall write down a model for ¢,. As 
usual, we must chose coordinates on each side. As for the right-hand side we 
shall of course consider W-coordinates. We note that when y tends to zero 
W tends to the coordinate z of section 2 since it takes values 0, 1 and oo at 
the three ramification points of the limit covering. 

On the left-hand side we must pin three points to 0, 1 and oo. We must 
be careful to choosing three points that do remain distinct when p tends to 
zero. For example, points above U, and U2 may well coalesce since U, and 
U, coalesce. On the other hand, a point above U; and a point above U3 will 
not coalesce. Two points above U3 will not coalesce either. This corresponds 
mutatis mutandis to the notion of admissible families of points introduced in 
Definition 3 of [7]. The key mathematical idea underneath is to be found in 
sections 2 and 3 of [11]. 

In our situation we see that {¥3, ¥s, Ve} form an admissible family because 
Vs and Vg map to U4 which does not coalesce to any other point. Indeed when 
pe tends to zero then V3 tends to the point S, of section 2 because V3 is above 
Uy and U2 tends to R, and S; is the unique point above R,. Similarly Vs 
tends to S_ because V; is above U4 and U4 tends to R3 and the multiplicity 
of V5 is equal to 1 and is not affected as p tends to zero because Uy is simple 
(i.e. does not coalesce) in the W-coordinate, see [11]. Also Ve tends to S3. 
We call X the coordinate which takes values 0, 1 and oo at V3, V5 and Ve 
respectively. Then X tends to the coordinate z of section 2. 

The covering ©, is now represented by a rational fraction ®,(X) = W 
such that 


®,(X) = a 


_—~ 
—~J 
— 
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We shall denote by v, the valuation associated to yx. We know that 
ay) — 7 and from the above 


Vulra) + 2uy,(X(Vi)) + 8y(X(V2)) = 7. (11) 


On the other hand since V3 and all the points above U; coalesce we know 
that v,(T2)s v,(X(Vi)) and v,(.X(V2)) are positive integers. We deduce that 


v, (12) = 2and v,(X(Yi)) = v(X(V2)) = 1. We write 


X (V2) = w(v2,0 + varie + v2,2H? +...) 


In order to complete the picture we now look at the situation from the 
point of view of the coordinate Z. We see that Z(U3) = A tends to co = Z(U4) 
when p tends to 0 while Z(U,) = 0 and Z(U2) = 1. 

At the end we get a sphere minus three points U,, Uz and U3; = U, 
and a fundamental group 7(P\(C) — {U,,U2,U3 = U,4},b) generated by 
L,, 02,34 = [304 as on figure 8. Indeed, when U3 and U, coalesce, turning 
around the resulting point is-just turning around U3 then U4. Topologi- 
cally, this is equivalent to punching a big grey hole containing U3 and U4 or 
equivalently removing the segment [U3, U4]. Turning around this big hole is 
equivalent to turning around U3 and then around U4. 


Figure 8: 7(P, — {U,, U2, U3 = Us)}, b) 


We conclude that when p tends to zero the covering @,, tends toa covering 
of the sphere minus three points with monodromy ((),¢2,¢4¢3) in the basis 
(Ti, T,T3,4) of m™(P, — {U,, Uz, U3 = Uy}, 6) shown on figure 8. Coming back 
to section 2 we see that G = 7 and ¢ = 72 and CyC3 = 73. Therefore ¢, 
tends to g when pz tends to zero. 

In order to take advantage of this we shall write down a model for Pu 
with adapted coordinates on each side. As for the right-hand side we shall of 
Course consider Z-coordinates. We note that when p tends to zero Z tends 
to the coordinate z of section 2 since it takes values 0, 1 and o¢ at the three 
Tamification points of the limit covering. 

On the left-hand side we must pin down again three points that do not 
Pairwise coalesce, for example {V3, V2, Vg}. We pick a coordinate Y that takes 
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values 0, 1 and oo at V3, V2 and Vg respectively. This implies that ALY 4 
constant equal to X(V2)/¥(V2). We set « = a = v20+O() and we ad 


X = pry * (12) 


Note also that Y tends to the coordinate y of section 2. 

The covering ©, can be now represented by a rational fraction U wW(Y)=2 
which is related to 6,(X) = W by Z = WA and X = wrY. We replace xX 
by #«Y in equations 7, 8 and divide out by y and find 


| 
Kp Kan(Y? — SY + 2,)(y — SE yary — 1)8 
Y)= ae 0 oD A mY 0 i 
and 


AgKanTY(¥? — “y? + pty = <a) 
si al Ora ea (14) 


As u tends to zero W, tends to the rational fraction G given in section 2. 
Comparing the leading coefficients in 13 and 3, and using 6 we find 


6° 60 
7 2 
As 77 2,0 = fat 
We now take advantage of the freedom in chosing A3 assuming A‘ is the 
unique real root of 


so that 


Since our covering is real we deduce that V2.9 = 1 so 


X(Vs) = (1 + O(y)). 
Comparing 13, 14 and 5 we find 
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_ hyX*(X? = ay xX" + $9X = $3) 


®,(X) — x a (3) 
and 
ee 2 a ee ea) 
i X-1 
(9) 


Where Aa, "1, 72; $1, $2, $3, U1,---,4s depend on p. 

As p tends to zero the covering ¢, tends to f while the coordinates X 
and W tend to z and y. Therefore the rational fraction ©, tends to F. From 
the comparison of formulae 1 and 2 on the one hand and 7, 8, 9 on the other 
hand we deduce the following 


rr = O(n) 
rz = O(n) 
X(Vi) = O(n) 
X(V2) = O(p) 
8s, = O(n) 
s2 = O(n) 
s3 = O(n) 
7 
X(V4) = g + Olu) 
7 
uw = —3 t O(n) 
49 
We = a5 + Olu) 
343 
US Soe O(n) 
12005 
Ug = [296 O(u) 
16807 
BS =) — 99g HMR 
‘ 6° 
Ky = aq + O(n) 


Further if we set X = 0 in equations 7 and 8 we find 


x = Kar2X(Vi)?X(W). (10) 
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5 
%s ee eore 
od 


19 
n= He ta5t +3 7? + O(u)) 
X(Vi) = HO )) 


X(Va) = p(1 + O(u)) 


This time we have first order approximations for all the coefficients arising 
in our algebraic model. Computing higher order approximations now reduces 
to linear algebra by Hensel’s Lemma. We just plug formal developements into 
equations 7, 8, 9 and develop in the variable . We get a non singular linear 
system of equations in the next order terms etc. 

For example we find 


X(Vo) = p— (8/21 45f/42)y? + 

+(101/2205 + 529f/4410 — 5f?/252)u3 +.. 
fy — (5f/21 + 11f? /42)u? + 

+(29/4410 + 229f/17640 + 811f7/4410)u? + 


I! 


X(V1) 


5 Looking for algebraic dependencies 


In this section we shall derive an algebraic model from the analytic one ob- 
tained in section 4. 


5.1 General procedure 


In section 4 we obtained an analytic model for the covering ¢, with coeff- 
cients in the complete field Q(f)((u)). We know that this model is actually 
defined over the algebraic closure of Q(X) in Q(f)(()). We shall now look 
for algebraic dependencies between the various coefficients arising in the ex- 
pression for ¢,. This will give us a model for the Hurwitz space H. defined in 
section 3. 

We pick two functions (X(V,) and X(V2) for example) and look for alge- 
braic dependencies between them. Let us call C(#) the field of functions on 
the curve H (our Hurwitz space). This is a genus zero function field over C. 
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To ease notations we shall set vy} = X(V,) and v2 = X(V2). Assume the degree 
of the extension C(#1)/C(v,) is dj and the degree of C(H)/C(v2) is dz. Then 
there exists a polynomial E(.X,, X2) in two variables, with coefficients in C 
and degree in X (resp. in Y) equal to dz (resp. d,) such that E(v,, v2) = 0. 
If we know the expansions of v, and v2 with enough accuracy (i.e. more than 
(d+ 1)(d2 + 1)) finding such a relation is just a matter of linear algebra. We 
try successive increasing values for d; and dj. For dj = d, = 3 we find 


E(v1,v2) = —36v2v; + 24v, + 60vy + 54vyv1 + T2rj v2 — 18vy01 — 36v;07 
=e (15) 


We shall see in section 6 that the degrees dy and dz can be computed a 
priori from the monodromy ¢. We therefore need not compute infinitly many 
terms in our expansions. With bounded accuracy we can obtain enough de- 
pendencies to determine all the coefficients in our equation. This will provide 
us with a proof that the equation holds since we a priori know that such 
an equation does exist. Considerations in section 6 will enable us to choose 
functions like v; and v2 that make the degrees d; and dz minimal. 

The curve given by equation 15 is expected to be of genus 0. We therefore 
look for a parametrization using an algorithm due to Noether, Poincaré and 
Vessiot— see [18] for a complete algorithmic survey on this question, includ- 
ing problems of rationality. In our case, of course, finding a parameter is 
particularly trivial since 15 has a unique triple point. 

We find that T = v/v; is a parameter and 


a ee a a 
IS BT Lor 4 P) 
Let do be the degree of the field extension C(H)/C(v,, v2). We may rea- 
sonably expect this dg to be small (the irreducibility of equation 15 implies 
dy = 1). Let d3 be the degree of C(H)/C(r,). We now look for algebraic 
dependencies between T and r; with degree dp in r; and d3 in T. For dp = 1 
and d3 = 6 we find 


and v2 = Tv}. 


2(T + 4)(4T° + 9T? + 12T + 10)(T +1)? 
n= 
: 3(T4 + 6T? + 21T? + 16T +6) 
Similarly we find 


(47° + 9T? + 12T 4+ 10)? 
3(T? + 2T + 2)(T4 + 673 + 217? + 16T +6) 
and this is enough for our purpose since C(H) = C(v1, v2) = C(T). 
Indeed we set yr(X) = (X? —rl1.X + r2)(X — v1)?(X — ve)8/(X — 1) and 


factor its derivative with respect to X. This derivative has roots vi, v2 with 


ro = 
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multiplicity 2, 0 with multiplicity 3 and v4 = X(V4). We deduce an expression 
for v4: 


= 27? + 67° + 21T* + 567° + 51T* + 30T + 10) 
reed 3T(T4 + 67? + 217? + 16T +6) 

We have Ky = 1/yr(va), x = yr(0)/yr(va), and O7(X) = yr(X)/yr(v4), 
Note that we now write ®; rather than ®, since T is the right algebraic 
parameter. 

The singular value x is given as a rational fraction in T which is unramified 
outside {0,1,00} i.e. a Belyi function. The associated dessin is the one 
on figure 5 and the monodromy is the one described in section 3. Note in 
particular that the factorisation of y and y — 1 fits with fact 1. Indeed 
v= 2 and x-1= 1 with 


xo = —(T44 6T? + 21T? + 16T + 6)°T4(4T? + 9T? + 12T + 10)’ 


and 
xoo =(T—-1)9(T4 487° 4367? 440T+420)4(T 41)? (27? 4.742)? (274487? $157? $87 $2)? 
(T4372 467410)(2T° 412T° 45174 494T° 41117? 4607420) 
and 
x1 =—16(—80-288T—48T? 42432T° +7896T? +13776T° 415656T° 4124327" 46711 T® 4222279 
44777 9 46071 44712) (24274773 (78 4675 42174 45673 45172 +30T+$10)°. 
and 


T°(T* + 67? + 21T? + 167 + 6)°(T? + 2T + 2)® 
Xoo 
We can now replace J by values in Q and find coverings defined over Q 
in our family. The key point here is that our family is defined over Q. More 
precisely the Hurwitz space is irreducible and defined over Q and further has 
many rational points. To check our computations, we just make sure that 


formulae 7, 8 and 9 hold. 


Ky =—729 


5.2 Using numerical approximations 


In the previous paragraph we computed an algebraic model for our family of 
coverings from an analytic one using linear algebra computations over the field 
Q(f). Indeed we were dealing with series in Q( f)(()) with bounded accuracy. 
The computations, however, will be greatly accelerated if we work in C((2)) 
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-ctead of Q(f)((4)) approximating complex numbers to some fixed accuracy. 
yns ; : 

This accuracy depends now on the height of the equations we expect to find. 
We have no nice upper bound for this height. We just try. We compute an 
estimate for f and write down expansions for v; and v2 with approximate 
coefficients in C. For the linear algebra part (looking for dependencies) we 
no longer use Gauss algorithm but least squares. This gives a vector V that 
minimizes the L2-norm of MV—6 for a given matrix M and vector b. Of course 
jeast squares always give a solution and we need a criterion for this solution 
to be relevant, depending on the accuracy. We may give a quantified criterion 
put there is a very simple and efficient qualitative one: something interesting 
is happening if and only if a small perturbation in the data (i.e. close to zero 
according to current accuracy) induces a big gap in the minimal norm (i.e. 
change of magnitude). We thus obtain approximations for the coefficients in 
the linear equation we are looking for. Since these coefficients are expected to 
belong to Q(f) we then look for integer linear relations between 1, 6, 6% and 
any such coefficient c. This is achieved using the famous LLL algorithm [19]. 
Indeed such a relation corresponds to a small vector in the orthogonal lattice 
to the vector (N,|N@],|N6?],|Nc]) where N is a large integer (close to the 
inverse of the accuracy). And LLL is designed for finding such small vectors. 
Another possibility is to perform all the computations using successively all 
the conjugates of f in C and then form the symmetric functions of the results. 
We then obtain approximations of rational numbers and may find their exact 
values thanks to continued fraction algorithm. This is the strategy adopted 
in [1]. 

There is an important variant to the method described above. Once we 
have computed ©, as a rational fraction with coefficients in C(()) with a 
small accuracy (in the p-adic topology) we may stop there the computation of 
expansions and replace js by a small complex number jg. This will give us an 
approximation according to the ordinary absolute value in C of the rational 
fraction ®,,, branched at {0,1,00,\9}. We then plug this approximation into 
equations 7, 8 and 9 (where \ is fixed to the value \g) and apply an itera- 
tive numerical method like Newton’s method to get an arbitrarily accurate 
approximation. We may then replace Ag by a very close value 4; in equations 
Tand 8. The rational fraction ®,,, is then a close approximation of a rational 
fraction ©, branched at {0, 1,00, Ai} and the latter one is found by applying 
Newton’s method to equations 7 and 8 with \ fixed to 4; and initial value 
®,, for ®,,. We can move slowly this way in our Hurwitz space and list 
vectors of values of the various functions rj, r2, v;,... at many points in it. 
If Q,, Qo, ..., Q, are these points we represent any such function by an ele- 
Ment in the algebra C* and we can look for algebraic relations as before. We 
have replaced Taylor expansion at one point by interpolation at many points. 

€ may of course mix the two approaches. We have to be careful that our 
Points Q; should be well distributed on the Hurwitz space. Otherwise we are 
80ing to loose much accuracy since too points that are close to each other 
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lead to almost the same equations. We use least squares as before with the 
same criterion for testing the relevance of the result. This approach was used 
succesfully in [14]. 


6 Stable curves 


In this section we shall give elements for the generalization of the method 
presented in section 4. We shall also explain how one can compute the degrees 
of coefficients in some algebraic model for a family of coverings by mere 
consideration of its monodromy. This will be useful when looking for algebraic 
depencies since it will tell us what is the degree of the relations we are looking 
for. In particular we shall be able to select functions that satisfy algebraic 
relations with smallest possible degree. 

The key point in section 4 was that the degeneracy of a sphere minus four 
points could be seen from two different points of views (namely according to Z 
or W coordinates). There is a standard way to reconciliate these two points of 
views. It is connected with the compactification of the moduli space of genus 
zero r-pointed curves. This compactification is very explicitely described in 
[12] in terms of trees of projective lines. In this section we shall assume that 
the reader has some familiarity with this work. We just recall that the authors 
construct a fine moduli space for r-pointed curves from the consideration of all 
possible cross-ratios between four marked points on the sphere. One defines 
the cross-ratio [(U;, V2, U3, U4] as (zs — 21)(z4 — z2)/(z3 — 22) /(z4 — z) where 
21, 22, 23, 24 are the value of any coordinate z at U,, U2, U3, U4. Note that 
[0,00,2,1] = z. The relations between these cross-ratios are of two types. 
Those coming from the action of the symmetric group Sq on the cross-ratio 
[U,, U2, U3,U4] by permutation of the indices plus a relation involving five 
points 


(U2, Us, Us, Us.[Us, Ue, Ua, Ua} = (Ui, Us, Us, Ua). 


This last relation is connected with the algebraic group law on P, — 
{U3,U4}. If we consider the projective variant of these equations (introduc- 
ing numerators and denominators for all cross-ratios) we obtain a projective 
variety Mo, which contains the moduli space Mo, as an open subset. 

There also exists a universal curve Mor+1 —> Mor corresponding to “for- 
getting the last point”. 

The degenerate curves can be described as follows. Assume that we have 
a projective line minus four points U;, Uz, U3, U, and let U, and U2 coalesce. 
The object we get at the end is obtained in the following way : replace U; 
and U2 by another line crossing the first one and put U, and U2 on it. 

Now let A = C{[y]] be the local ring of Laurent series in the parameter 
wand @ = C((z)) its quotient ring. Let ¢, : C, + Py — {U1,U2,U3, U4} 
be a covering defined over Q (i.e. a family of coverings parametrized by the 
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Figure 9: A 4-pointed tree of two projective lines 


local parameter p). We assume that the cross-ratio [U,, U4, U2, U3] is equal 
to p*° for some positive integer e. Let W be the coordinate that takes values 
0, °, 1, oo at U,, Uz, Us, Us respectively. Let Z = W/p* be the coordinate 
that takes values 0, 1, u~°, co at U;, U2, U3, Us respectively. The associated 
field extension to ¢, is Q(W) C Q(C,,). By restricting ys to values in [0,1] 
we may define the monodromy ¢ = (¢1,¢2,¢3,¢4) of , as in section 4. We 
assume (C2¢1)° = 1. We can always reduce to this case after base change (i.e. 
replacing pz by us for some integer o). We now consider the ring Ro generated 
by W and T = 1/Z over A. This ring Ro is equal to Cl[y}][W, 7] /(WT — p°). 
Let also Ry = A[W] and Ry = A[T]. Then Spec(Ri) and Spec(R2) glue 
together along Spec(7o) to form a fibered surface S over Spec(A) the generic 
fiber of which is a smooth curve of genus zero while the special fiber S, is made 
of two genus zero curves VY and W crossing at the point O with coordinates 
W =T =p = 0. The local ring at O is Cl[p, W,T]|/(WT — u*). The Zarisky 
closures of the points U;, U2, U3 and U, define horizontal divisors on S. We 
represent the situation in figure 10. 


Uy 


U2 


Figure 10: The fibered surface S 


We may now take the normal closure of S in the field extension Q(W) C 
Q(C,,) and obtain a fibered surface T over Spec(A) and a map y: 7 > § 
Whose generic fiber is just 6, The point is that JT can be described quite 
Sharply in terms of the monodromy ¢. First of all, the special fiber JT, isa 
Connected curve made of several irreducible components meeting each other 
transversally and with no other singularity than these crossing points between 
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distinct irreducible components. 

The irreducible components of 7 fall in two parts. Those that are mappeq 
onto V by y which we call V; for 1 <2 < J and those that are mapped onto W 
which we call W; for 1 < 7 < J. The components V; correspond to the orbites 
O; of (C1, ¢2,G4¢3) and the restriction of y to V; is a covering ramified ove, 
three points with monodromy (1, ¢2, ¢4¢3)|o, where the | means restriction 
to O;. Similarly the components W; correspond to orbites Q; of (€2¢1, 3, C4), 
Two components V; and W, intersect if they share a cycle of ¢2¢1 = (¢4¢3)7! 
and these cycles are in bijection with the crossing points O; forl1 <k< 
on 7;. These points are the points above O. 

If the crossing point O, of V, and W, is associated to a cycle of length 
£, in C2¢) then the local ring at O; is isomorphic to C[[y, w, t]]/(wt — pt) 
where ¢ and w are local parameters at O, on V; and W, respectively. The 
ratio 0, = Z, 1s called the thickness of the intersection point O;. Note that 6, 
is an integer because we assumed ((2¢,)° = 1. Note also that 6, depends on 


the local parameter p in the sense that if we replace yu by [Le for some integer 
o (base change) the thickness 6; is multiplied by o. 

The points mapped to U; and U2 by @, correspond to cycles of ¢, and ¢ 
and their Zarisky closures in 7 cross the Y;’s while the points mapped to U3 
and U's, by ¢, correspond to cycles of ¢3 and ¢4 and their Zarisky closures in 
T cross the W,’s. 

In case the generic genus is zero, the special fiber 7, is a r-pointed tree of 
projective lines as in [12] and 7 is a deformation of it over S'pec( A). 

We show two examples of such a situation. These examples will be two 
degeneracies of the covering @7 studied before. 

Assume first that ¢ is the monodromy cs studied in section 4, correspond- 
ing to point C on figure 5. Both (¢1, 2, ¢4¢3) and (C201, ¢3, ¢4) have a single 
orbite. Therefore we have a single component VY; above VY and a single com- 
ponent W, above W. Since ¢2¢; is a full 7-cycle we have £; = 7. On the other 
hand, the order e of the braid t;,2 acting on ¢ is also 7. Thus the thickness of 
the unique point O; above O is just 6; = 1. 

We draw the corresponding situation on figure 11. 

Assume now that ¢ is the monodromy d, given in definition 2. 


qa = [1], [2], (3, 4], [5,6, 7], 
C2 = {1, 2,3, 4], 

¢3 = 1,5], 

Cr Se (GG): 


This monodromy corresponds to point D in figure 5. Now (1, C2, C4¢3) 
has two orbites O,; = {1,2,3,4} and O2 = {5,6,7} while (G21, ¢3, C4) has two 
orbites Q; = {1,2,3,5,6,7} and Q. = {4}. 

We have three points O,, O2 and O3 above O corresponding to the three 
cycles (1,2,3), (5,6,7) and (4) of G24). 
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Vi 


Va V2 


Wi Thickness @, = 1 


Figure 11: The special fiber at point C € H 


The order e of braid action is 3 thus the thicknesses are 6; = 1, 62 = 1, 
03 = ; 


We draw the corresponding picture on figure 12. 


Wi 


Figure 12: The special fiber at point D € H 


Thicknesses are very useful to compute the order of vanishing (s:-adic 
valuation) of cross-ratios. For example, in figure 12 the thickness 6; of O, is 
Nothing but the valuation of the cross-ratio [Va, Vi, Ve, Val. 

More generally, recall that associated to each component K of a stable 
tree of projective lines there is a projection Px of the full curve onto this 
component. There is also a unique median component L; associated to any 
triple of non-singular points 6 = (61,42,43). This component is the unique 
One on which the three points have pairwise distinct projections. 

If we have four points Vi, V2, V3, Vs crossing the special fiber at non- 
Singular distinct points, then there are only two possibilities. 

Either the intersections of these points with the special fiber have dis- 
tinct projections on some component of it. In this case the cross-ratio p = 
[V., Va, V2, Va] is a unit in A = C[[y]] and so is p—1. This is the case for Vi, 
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V3, V4, Vs on figure 12 since their instersections with the special fiber have 
distinct projections on Yj. 

If the intersections of Vj, V2, V3, Vs don’t project on distinct points op 
any component, then there are two components K, and Kz with assoc}. 
ated projections Px, and Px, such that for example Pr,(Vi), Pe, (V2), Pr, (Va) 
are pairwise distinct while Px,(V3) = Px,(Vs) and Pr, (Vs), Pr, (V4), Px, (Vj) 
are pairwise distinct while Px,(Vi) = Px,(V2). In this case, the cross-ratig 
(Vi, V3, V2, V4] has p-adic valuation equal to the distance between K, and Le 
which is defined as the sum of the thicknesses of all intersections between K, 
and Ky. 

In particular, we see that the multiplicity of this cross-ratio is bounded 
by e times the number of intersections. And there are no more intersections 
than the degree d, of the covering ¢,. 

We deduce that if we have a genus zero covering ¢ of degree dy of P, 
ramified over four points U,, U2, Us, Uy and if we pick V1, V2, V3, V4 above U,, 
U2, U3, Us respectively, then the cross-ratio p = [V,, V3, V2, Va] only vanishes 
when » = [U,,U3,U2,U4] vanishes and the multiplicity of p is at most d, 
times the multiplicity of A. The exact multiplicity can be obtained from the 
geometry of the special fiber as we just did. 

Let us now call d, the degree of the Hurwitz map from the moduli space 
H for a family of coverings to Mo4 = Pi; — {0,1, 00}. We represent below this 
Hurwitz map and the covering of universal curves as a fibration above it. 


as ay 
A ® 
Mos 7 Mos 


The degree of the extension C(Mo4) = C(A) C C(H) is da and from the 
calculation above we deduce that the degree of the extension C(p) C C(H) is 
bounded above by da x dg. 

We thus have a bound for the degree of coefficients appearing in some 
algebraic model for ¢. 

This bound is indeed very pessimistic since it assumes that we always have 
many orbites and thick intersections between them. In practice one rather 
expects degrees like d,a/dy. In our example we have dg = 7 and da = 48 so 
that our estimate gives 48 x 7. If we look at formulae in section 5 we find 
that all quantities are rational fractions of degree 6 = 48/7 of the parameter. 

In all cases, the extensive combinatorial study of all degenerations of a 
genus zero covering gives the divisor of zeros and poles of any cross-ratio 
between branched points. One can then cook a multiplicative combination of 
these cross-ratios with smallest possible degree. This amounts to finding the 
shortest vector in the linear space generated by the divisors of all cross-ratios. 

Incidentally we obtain a criterion for the Hurwitz space H to be rational : 
it is rational if there exists a (combination of) cross-ratio with a single zero. 
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‘, may be checked easily on the monodromy ¢ in some cases. 
The method presented here applies to any genus zero covering of the 
pere minus r points. When r is bigger than 4 one has to consider maxi- 
mally degenerate covers. The local ring at the corresponding points in the 
moduli space Mo, is generated by cross-ratios. The ramification at the mini- 
om! primes corresponding to these cross-ratios is computed in terms of braid 
action and the order of vanishing of cross-ratios of branched points along the 
corresponding divisors can be evaluated as before. This gives a method for 


computing the partial degrees of any coefficient in the algebraic model. 


7 Conclusion 


We have shown how the computation of a family of coverings can efficiently 
reduce to the computation of degenerate coverings of the sphere minus three 
points. The most degenerate will be the better. Reciprocally, if we want 
to compute a covering of the sphere minus three points, we may find it as 
a special fiber in a family of higher dimension. The latter family may well 
be easy to compute provided it admits an(other) simple special fiber. We 
also have given a method for computing the degree of a coefficient c in some 
algebraic model (the degree of C(H)/C(c)) in terms of the monodromy ¢. 
These degrees depend on the geometry of the degenerate coverings in the 
family. 
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Some Arithmetic Properties 
of Algebraic Covers 


PIERRE DEBES 


ABSTRACT. Consider a Galois extension F/A and an algebraic cover f: 
X—B a priori defined over F. The cover f may have several models 
(and possibly none) over each given subfield EF of F. How do these models 
compare to each other? Are there better models than others? We establish 
here a structure result for the set of all various models which can be used 
to investigate these questions. The structure, which is of cohomological 
nature, yields an interesting arithmetical tool: K-covers can be ‘twisted’ to 
provide other K-models with possibly better properties. One application 
is concerned with the Beckmann-Black problem. E. Black conjectures 
that each Galois extension E/K is the specialization of a Galois branched 
cover of P’ defined over A with the same Galois group G. We show the 
conjecture holds when G is abelian and K is an arbitrary field; this was 
known for number fields from results of S. Beckmann (1992) and E. Black 
(1995). Other applications include discussions of existence for a given 
cover, of a “good” model, a stable model, a model with a totally rational 
fiber, etc. Also we clarify an inacurracy in a result of Fried about field 
of moduli and extensions of constants. Finally, we continue our study of 
local-globa) principles for covers: if a cover is defined over each Q,, does 
it follow it is defined over Q? Here we consider the case of Galois covers 
of a general base space B. 


1. Introduction 


This paper is organized as follows. §2 is devoted to the structure result 
mentioned in the abstract for the set of all A-models of a given cover f : 
X -— B a priori defined over a Galois extension of K. There are two 
versions. In the first one (§2.4) we assume that the base space B of the cover 
satisfies a certain condition introduced in [DeDol|] and called the (Seq/ Split) 
condition. That condition, which is recalled in §2.3, holds for example if 
the base space has unramified K-rational points. The general case of the 
structure result is given in §2.5. §2.1-§2.3 review some basics relative to the 
arithmetic of covers. 

§2 is used in §3 to study two questions about the realization of covers 
with some arithmetical constraint on some fiber. The first one (§3.1) is 
known as the Beckmann-Black problem: given a field K, is every finite 
Galois extension E/K the specialization of a Galois branched cover of P! 
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which is defined over JY and has the same Galois group? Beckmann and 
Black answered positively when G is abelian and Ix is a number field. We 
extend this result to arbitrary fields A’. The second question (§3.2) is about 
the existence for a given J-cover of an unramified totally A-rational fiber. 
We comment on this property: in particular we recapitulate what is known 
about it and review several situations where it revealed helpful. 

§4 is about field of moduli (§4.2) and extension of constants (§2.5.1). 
Given a [-cover fr, consider, on one hand, the Galois closure fx of fix 
over A and, on the other hand, the Galois closure f of fx @n NW, over the 
separable closure K, of K. ThA.1 (84.3) relates the extension of constants 
of fx in fx and the field of moduli of f as G-cover. Two consequences are 
given (§4.4). The first one is a criterion for the field of moduli of f to be 
a field of definition (as G-cover). The second one clarifies an inacurracy in 
a result of Fried about extensions of constants in a cover with centerless 
group. A counter-example to Fried’s original statement is given in 84.1. 

The theme of §5 is the local-to-global principle for covers: if a cover is 
defined over each Q,, does it follow it is defined over Q? The state of the 
question is recalled in §5.1. Essentially the local-to-global principle holds for 
G-covers; and for mere covers, it holds under some additional hypotheses 
on the monodromy group and is conjectured not to hold in general. We 
consider the case of mere covers that are Galois over Q. The local-to-global 
principle is known to hold then if the base space B satisfies the (Seq/Split) 
condition. Here we consider the general case, that is, we do not assume 
the (Seq/Split) condition. Our main result appears in §5.3. Our approach 
uses the notion of Galois covers given with the action of a subgroup of the 
automorphism group. These objects, called SG-covers, and which generalize 
both mere covers and G-covers, are introduced in 85.2. 


NOTATION — Given a Galois extension E/k, its Galois group is denoted by 
G(E/k). Given a field k, we denote by k, a separable closure of k and by 
G(k) the absolute Galois group G(k,/k). 


2. Structure result for models of a cover 


2.1. Mere covers and G-covers. The main topic of this paper is the 
arithmetic of covers. There are classically two situations. One is concerned 
with not necessarily Galois covers — traditionally called “mere covers” — 
while the other one considers “G-covers”, 1.e., Galois covers given with the 
Galois action. 

Given a field K, a regular projective geometrically irreducible vari- 
ety B defined over A, mere covers f : X > B over Kx correspond (via 
an equivalence of categories) to finite separable regular field extensions 
K(X)/IK(B) while G-covers of B of group G over A correspond to regu- 
lar Galois extensions A(X )/K(B) given with an isomorphism of the Galois 


group G(h'(X)/K(B)) with G. 
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We denote the variety B with the reduced ramification divisor D removed 
by B* and the K-arithmetic fundamendal group of B* by IIx(B*) or simply 
by IIx when the context is clear. Degree d mere covers of B over F, un. 
ramified over B* correspond to transitive representations UV : IIp(B*) > ae 
such that the restriction to II,(B*) is transitive. G-covers of B of group 
G over F correspond to surjective homomorphisms ® : Ip(B*) + G such 
that @(Ip,(B*)) =G. 

We freely use these notions in the sequel; see [DeDo1;§2] for more details. 
In the rest of this section we fix a field K and a variety B as above. 


2.2. Descent of the field of definition of [G-]covers. As in 
|DeDol], we frequently use the word “[G-]cover” for the phrase “mere cover 
[resp. G-cover]”. Suppose given a Galois extension F/K and a [G-]cover 
f :X — B a priori defined over F and such that the ramification divisor D 
is defined over A’. In the mere cover situation, we will always assume that 
the Galois closure over F of the mere cover is, as G-cover, defined over F. 
This insures that the group of the cover (1.e., the Galois group of the Galois 
closure) is the same over F' as over F',. This is of course not restrictive in 
the absolute situation, t.e., when F is separably closed. 

A K-model of the [G-]cover f is a [G-]cover fxr : Xx — B over K such 
that the [G-]cover over F' obtained from fx by extension of scalars from 
to F’, which we denote by fx @x F, is isomorphic to f over F. A [G-]cover 
f is said to be defined over K if it has a K-model fx. A significant problem 
is to study the descent of the field of definition of the [G-]cover f, and, more 
generally, to find its k-models for K C k C F. We introduce some notation 
that makes it possible to handle these questions simultaneously in both the 
mere cover and G-cover situations. 

Let ©: Ip(B*) - Sq [resp. ® : Ilp(B*) > G] the representation of Ip 
corresponding to the mere cover [resp. G-cover] f : X — B. In both cases 
let G denote the group of the cover. Then set 


G in the G-cover case 
N= 


Nors,G in the mere cover case 
Z(G in the G-cover case 
C = CenyG = (G) 
Cens,G in the mere cover case 


where Z(G) is the center of G and Nors,G and Ceng,G are respectively the 
normalizer and the centralizer of G in Sy. Finally regard N as a subgroup 
of Sq where d is the degree of f: in the mere cover case, an embedding 
N < Sq is given by definition; in the G-cover case, embed N = G in Sq by 
the regular representation of G. 

Then, in both the mere cover and G-cover situations, the [G-]cover f : 
X —+ B corresponds to the representation ¢ : II-(B*) -» G C N and the 
following holds [DeDol]. 
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proposition 2.1 — (a) The K-models of f correspond to the homomor- 
isms In (B*) — N that extend the homomorphism ¢: IIr(B*) > GC 

N. In particular, the [G-]cover f can be defined over the field K if and only 

tthe homomorphism : Up(B*) + GCWN estends to an homomorphism 

x(B") > N, 

(b) Two [G-Jcovers over F are tsomorphic if and only if the corresponding 

representations ~ and $' of Ip are conjugate by an element y in the group 


N, that is, $'(x) = pd(z)y™ for all x € Ir(B*) 


A representation II;,(B*) > N extending 9: Ir(B*) ~> GCN will 
be called a A-model of ¢. From (a), K-models of the representation ¢ 
correspond to A’-models of the [G-]cover f. 


2.3. Fibers of a cover. 


9.3.1. Condition (Seq/Split). Fix a Galois extension F'/K’, a divisor 
D of B with simple components defined over A and assume that the exact 
sequence of fundamental groups 


1 > Ilp(B*) > In (B*) > G(F/K) 7 1 


splits. This condition was introduced and called (Seq/Split) in [DeDol}. 
Consider the special case F = A’, and the base space B has K-rational 

points off the branch point set D. Then condition (Seq/Split) classically 

holds: indeed each unramified A’-rational point t, provides a section S;, : 


G(i) —- Te B 


2.3.2. Arithmetic action of G(F/K) ona fiber. Suppose given a degree 
d mere cover fy : Xx — B over W unramified over B* and let ox : 
IIx (B*) + Nors,G be the associated representation. Given an unramified 
K-rational point t, on B, denote the compositum of all fields of definition 
over Ky of points in the fiber f,-" (to) by Ky, 1,3 equivalently, Ky,2, is the 
compositum of all residue fields at t, of the Galois closure of the extension 


K(X)/K(B). We call the field Ay, 4, the splitting field of fy: at to. 


Proposition 2.2 [Del;Prop.2.1] — For each r € G(K), the element 
($x S:, (7) ws conjugate in Sq to the action of r on the fiber doe (bake Fur- 
thermore, the splitting field Ny,.1, of fx: att) corresponds via Galois theory 
to the homomorphism OK Sq, : G(K) > N; that is, it is the fired field in 
K, of Ker(dxS1,) and the Galois group of the extension Ks, 1,/K 1s the 
image group of OK Sty. 

aia 


On the other hand, condition (Seq/Split) does not always hold: an example in which it does 
Not is given in [DeEm]. 
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Return to the general case: let Ss : G(F/K) > II, denote a section to 
the map Iy —» G(F/K‘) (not necessarily of the form s;,). Each element 
of Il, induces a permutation of the different embeddings of the function 
field A(X.) in a separable closure (A'(B)), of A'(B). This set of embed. 
dings A(X) — (A(B)), can be viewed as the geometric generic fiber of 
the cover. By analogy with the case $ = $;,, for each Tt € G(F/IC), the 
element (¢4°8)(T) is called the arithmetic action of t on the generic fiber 
associated with the section $ [DeDol;§2.9]. Furthermore, the fixed field in 
F of Ner(¢x S$) is denoted by Is, 5 and called the splitting field of f at S. 
the Galois group of the extension Jy, 5/I is the image group of d,s. 


2.3.38. Remarks on Iv-points versus K-sections. 


(a) Assume F = J, and call A-sections on B* the sections to the map 
I,(B*) — GUy). For simplicity, assume char(A’) = 0. It is unclear how big 
is the set of A’-sections on B* that do not come from A’-rational points on 
B. This set need not be empty. Take for W anon PAC field of cohomological 
dimension < 1 (e.g. NW = Q*°): there is a smooth projective K-curve B with 
no Jy-rational points but from condition cd(’) < 1 there are [’-sections on 
Be 

(b) For some purposes, A’-sections can be as useful as W’-rational points. 
For example, assume a finite group G can be realized as the Galois group 
of a regular Galois extension E/K(B); let dx : In —» G be the associated 
representation. Assume B is a [-rational variety and WV is hilbertian. Using 
Prop.2.2, the hilbertian property can be rephrased to assert that there exists 
an unramified J’-rational point t, € P! such that the composed map Ox St, : 
G(Iv) — G is onto, thus yielding a realization of G as Galois group over 
iy. In fact any A’-section such that 48: G(IV) > G is onto would be just 
as good. So the question arises whether there is a weaker assumption on B 
than “B is Iy-rational” that guarantees that if I’ is hilbertian, there exists 
a Ix-section § such that ¢4°S : G(Iv) > G is onto. 


(c) Consider the Fried-Vo6lklein/Pop theorem: a field that is countable, 
PAC and hilbertian necessarily has a pro-free absolute Galois group of 
countable rank. At some point, the proof uses the existence of W’-rational 
points on some A’-variety. If A-rational points could be replaced in the proof 
by Jy-sections, then the assumption PAC could be replaced by “cd(A’) < 1”. 
Thus the Fried-Volklein conjecture — cd(¥) < 1 and W hilbertian implies 
G(Ix) pro-free of countable rank, for countable fields — would follow. And 
the Shafarevich conjecture —- Q?° pro-free —, which is a special case, would 
follow as well. 


2.4. Structure result under (Seq/Split). Assume that condition 
(Seq/Split) holds and let s : G(F/K) > IIx be a section to the map 
Ny —» G(F/Iv). Let f : X > B be a [G-|cover defined over F, unramified 
over B* and @: IIr(B*) + GCN be the associated representation. The 
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Jiowing notation is used below. Given two maps a, 3, the composed map 
denoted by o@ (when it exists); for maps from a set S to a group G, the 
roduct map, sending each s € S to a(s)G(s), is denoted below by a- /. 

P 


proposition 2.3 — Assume f is defined over KX. Let of.: In - N be 
the representation associated with some N-model fh. of f and set p° = o§-S. 
Then the set of all K-models Ox of @ is in one-one correspondence with the 
j-cochain set Z1(G(F/IX),C, 9°) (precisely defined in the proof). 

More precisely, the I<-models of @ are those maps én : Ux — N 
which equal d on Ip and equal 6-y° on G(F/K) (va $s), for some 6 € 
2(G(F/K),C, 9°). 

Furthermore, if S' is a section to the map In, — G(F/K), thens’ =a-s 
for some a € Z'(G(F/K), r,s) and the arithmetic action of G(F/K) on 


the generic fiber of bx associated with the section S' 1s given by 


g=oor-8-9° 


Proof. The field K is the field of moduli of the [G-|cover f relative to the 
extension F’/ Ix (since f is defined over A’). ;From [DeDol] (see Main Theo- 
rem (III)), we have the following. Let @: In — N/C be the representation 
of IIx modulo C given by the field of moduli condition [DeDol1;§2.7]. The 
K-models of f correspond in a one-one way to the liftings py : G(F/K) > N 
of the map (8S. More precisely, to each given lifting »y : G(F/K) — N cor- 
responds a A’-model of the representation ¢, namely the one that equals 
@ on IIp and equals y on G(F/K’) (via S). This A-model has the further 
property that the action y: G(F/K) > N C Sq is the arithmetic action of 
G(F/K) on the generic fiber associated with the section Ss. 

The map y?° is a lifting of @s. Therefore the set of all liftings exactly 
consists of those maps y : G(F/K) > N of the form y = @- y® where 6 is 
any element of Z1(G(F/K’),C,°), 7.e., is any map G(F'/K) + C satisfying 
the cocycle condition 


O(uv) = O(u) Av)? ™ 


Finally, let s’ be a section to the map IIx + G(F/K). Then o =s's7! 
Satisfies the cocycle condition 


o(uv) = o(u) a(v)s™ 


thus, lies in Z'(K,II,,,8). The arithmetic action on the generic fiber of 
?k associated with the section $’ is given by 


xs = ¢x(o-S)=¢0:6-9° O 
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2.5. Structure result (General Case). We fix a Galois extension 
F/IX and a divisor D of B with simple components defined over K. We no 
longer assume that condition (Seq/Split) holds. 


2.5.1. Extension of constants in the Galois closure ({DeDol1;§2.8]. Let 
fr: Xx — B be a [G-]cover over A and let ¢4 : Wx (B*) > N be the 
associated representation of II,,(B*). Consider the function field extension 
K(Xx)/K(B) associated to fic. Denote the Galois closure of the extension 
K(Xi)/K(B) by K (Xe )/[K(B ); its Galois group is the arithmetic Galois 
group of fx; denote it by G. Consider then the field K = K ey \NF. The 
extension Iv /K is called the extension of constants in the Galows closure of 
fr of f. 

Denote by A the unique homomorphism G(f’) — N/G that makes the 
following diagram commute. 


Ix (B*) ——> G(k) 


Proposition 2.4 [DeDol;Prop.2.3] — The homomorphism A corresponds 
to the extension of constants K/K in the Galois closure of the model fi 
of fr. That 1s, G(F/K) = Ker(A). The field K can also be described as 
the smallest extension k of N such that dx(Ik) C G, or, equivalently, such 
that kK (Xx) /k is a regular extension. 


The homomorphism A : G(A’) + N/G is called the constant extension 
map (in Galois closure) of fx. For G-covers, N/G = {1}, A is trivial 
and h = K: by definition, G-covers over K do not have any extension of 
constants in their Galois closure. 


Proposition 2.5 — Let f be a [G-Jcover over F and ¢ : Ur(B*) > 
GCN be the associated representation. Assume f is defined over kK. Let 
% : Tx (B*) + N be the representation associated with some K-model fj, 
of fF. 

Then the set of all K-models éx% of @ with the same constant ezten- 
sion map A as 6% is in one-one correspondence with the 1-cochain set 


Z'(G(F/K),Z(G),A). More precisely, the K-models of @ are those maps 


DEBES: Some arithmetic properties of algebraic covers 73 


‘In ~ N which are, for some 0 € Z1(G(F/K), Z(G), A), of the form 


o%, where P is the natural surjection P : Ix —» G(F/K). 


OK 
(OP): 


proof. From [DeDol], a K-model ¢% with constant extension map A 1s 
any homomorphism ¢x% : IIx — N extending ¢: Ilr > N and inducing the 
map A: G(F/K) + N/G. Furthermore, ¢x should also necessarily induce 
the representation % : Ih — N/C given by the field of moduli condition. 
Consequently 6 = ox : ( -)' has values in CM G = Z(G) and factors 
through G(F'//’). Prop.2.5 immediately follows. 0 


3. Covers with prescribed fibers 


In this section, the base space B is the projective line P! and F = Ks. 
In particular, condition (Seq/Split) holds. We retain this conclusion from 
Prop.2.3. If a K-model of a [G-]cover f (a priori defined over K’,) is known, 
then other K-models can be obtained by “twisting” by elements of a 1- 
cochain set Z'(G(’),C, —). More precisely, let t, € P!(A) be an unramified 
point and s;, the associated section G(K’) + Ix. The representations 
ox : Ik — N associated with K-models of f are completely determined 
by their restriction to IIx, (which is given) and their restriction p = 0k $1, 
to G(K). If one K-model is known that has y = y®, others are obtained 
by replacing y°? by 6-y°, for any 6 € Z}(G(K),C,y°). This arithmetical 
twist is an important ingredient of the results of this section. 


3.1. The Beckmann-Black problem. In [Be2], S. Beckmann asks 
the following question: is every finite Galois extension E/Q the specializa- 
tion of a Galois branched cover of P! which is defined over Q and has the 
same Galois group? A finite group G is said to have the lifting property 
(over Q) when the answer is “Yes” for every Galois extension E/Q of group 
G. She shows that finite abelian groups and symmetric groups have the 
lifting property. This problem has also been considered by E. Black who 
conjectured that each finite group has the lifting property over each field A 
(instead of Q) and proposed a cohomological approach. Her main result is 
that over a hilbertian field A’, a semi-direct product of a finite cyclic group 
A with a group H having the lifting property also has the lifting property if 
(||, |A]) = 1 and (char(K),|A|) = 1 [B12]. That includes the case of abelian 
€Toups and also gives new examples of groups with the lifting property, ¢-9- 
the dihedral groups D,, of order 2n when n is odd (see also [B11]). Using 
Our terminology, E. Black’s conjecture can be reformulated as follows. 


Conjecture 3.1 (E. Black) — Let K be an arbitrary field, G be a finite 
§roup and E/K be a Galois extension of group G. Then there exists 4 G- 
Cover f : Xi; > P! of group G defined over K and some unramified point 
to € P1(K) such that the splitting field extension K py jto/K of fr at to (see 
82.8.2) is K-isomorphic to E/K. 
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The main result of this section is the following one, which improves oy 
the initial results of Beckmann and Black in that the field K is here an 
arbitrary field. 


Theorem 3.2 — The Black conjecture holds if G ts an abelian groy 
and Ix is an arbitrary field. In particular, the conjecture holds if G(I‘) 
abelian (e.g. IN ts finite). 


is 


Proof. Here is our strategy. We realize G as the group of a [G-]cover 
f : X — P?! defined over AK. The splitting field extension Ky +,/K of f at 
the given point t, is some extension of A. Then we twist the A’-model ip 
such a way that the splitting field extension at t, equals the given extension 
Eyds, 

More specifically, suppose given a finite abelian group G and an extension 
E/Ix of group G. Realize it as the Galois group of a G-cover f}- : Xx — P! 
over Jy with at least one unramified point t, € P1(K’). This is easy if K 
is infinite. Indeed, take any regular Galois extension F/K(T) of group G 
(such extensions exist (e.g. [MatMa;Ch.4,Th.2.4])); there exists A-rational 
points on P?! different from the branch points of the extension F/K(T). 
However this is more difficult if A’ is a finite field, especially when the order 
of G is divisible by the characteristic of A. Nevertheless, this is possible: 
see [De3] where it is proved that each finite abelian group G can be realized 
as the Galois group of a G-cover defined over AK and unramified over each 
element of a finite subset D C P!(A’) given in advance. 

Next set S = S;,. Let y® : G(K) > G be the arithmetic action of 
G(s) on the fiber above t,. The given extension E/K corresponds to 
some surjective homomorphism y : G(/v) —-» G. For abelian G-covers, the 
set Z41(1X,C,~°) involved in Prop.2.3 is merely Hom(G(K’), G) (since here 
C = Z(G) =G)). Consequently, the cover f?- @q HK, has another K-model 
fr (as G-cover) such that the arithmetic action on the fiber above t, equals 
y: indeed take 6 = y(y?)! in Prop.2.3. This concludes the proof of the 
first part of Th.3.2. The second part readily follows. O 


3.2. Models with a totally rational fiber. In Th.3.2, the extension 
E/I< can be more generally any Galois extension of group H C G (instead 
of H = G). In particular for H = {1} we obtain that each finite abelian 
group is the Galois group a G-cover of P! over K with a totally K-rational 
unramified fiber alts): Furthermore, the point t, can be prescribed in 
advance in P!(i). This had been noticed in [Del] and [Des]. 

Recall that, given a cover fy : Xx — P! anda point tp € P(A) not a 
branch point, the fiber Tee ito) is said to be totally IK’ -rational if it consists 
only of K-rational points on X;. Below we comment on this condition; in 
particular we review some situations where the existence of a totally rational 


fiber revealed helpful. 
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9.9.1. Existence results. It is known that a finite group G is the Galois 
soup 2 G-cover of P' over WV with a totally K’-rational fiber in the following 
situations: 

e K is an arbitrary field and G is an abelian group (see above). 


e & isan ample field and Gis an arbitrary group: this result is essentially 
due to Harbater and Pop (see e.g. [DeDes]). Recall a field Iv is called ample 
if every smooth Wv-curve with at least one A’-rational point has infinitely 
many K-rational points. Algebraically closed fields, separably closed fields. 
more generally PAC fields are ample. Local fields are ample too. The fields 
Q'" [resp. Q'?] of all totally real [resp p-adic] algebraic numbers are other 
typical examples of ample fields. 


It is unclear whether this existence result holds for any field Jy and any 
group G. That would follow from a generalization of E. Black’s conjecture, 
where the extension F'/Iy would be allowed to be any Galois extension of 
group contained in the given group G. 


9.2.2. Field of definition of G-covers. A mere cover over Iv which 
is Galois over Jy, and has a totally A-rational fiber is then automatically 
defined over Jy as [G-|cover [Del]. 


3.2.38. Patching covers. D. Harbater showed that, over a complete 
valued field Jy, G-covers with a totally K-rational fiber can be patched 
and glued to provide a G-cover still defined over IV and of group the group 
generated by the groups of the initial covers. This patching and gluing result 
is an essential tool in the proof of the Regular Inverse Galois Problem over a 
complete valued field [Hal] and also in the proof of Abhyankar’s conjecture 


{Ha2]. 


3.2.4. Stable models. I proved in [Del] that a [G-|cover fy, defined over 
a Galois extension A’ of k and which has a totally N-rational unramified 
fiber, is then stable over k. That is, the field of moduli of fy relative to 
the extension A’/k equals the field of moduli of f = fx @% ky relative to 
the extension k,/k. D. Harbater and I combined this Stability Criterion 
with a Good Models result due to $. Beckmann [Bel] to prove the following 
Tesult [DeHa]. A G-cover of P! over Q, is defined over its field of moduli 
(relative to the extension Q,/Q,), except possibly if p is a bad prime, 7.e., 
if p divides the order of the group of the cover or if the branch points of 
the cover coalesce modulo p. The result actually holds in a more general 
Situation where Q, is replaced by the fraction field of a henselian discrete 
Valuation ring, with a perfect residue field of cohomological dimension < 1. 
M. Emsalem recently extended this result to mere covers [Em]. 


8.2.5. The Beckmann-Black problem. In [De2] I prove, for ample fields, 
@ mere form of the conjecture where the Galois cover is required to be 
defined over WV’ but only as mere cover. Existence of a G-cover defined over 
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KX with a totally Wv-rational unramified fiber is also a key ingredient of the 
proof. 


4. Field of moduli and extension of constants 


4.1. Ona result of M. Fried. In [Fr;Prop.2], M. Fried states that jg 
f :.X -— P! is a mere cover defined over JX with group a centerless group G 
then the arithmetic Galois group G (see §2.5.1) satisfies: GM Cens,G = {1} 
(where d is the degree of f). It seems that this statement 1s not exactly 
correct. Here is an argument showing why. Here again, we use the structure 
result (Prop.2.3) and the derived arithmetical ‘twist’ on A’-models of a cover, 

We use the function field viewpoint. Start with a regular Galois exten- 
sion E/IX(T) of group G and with a totally NW-rational fiber above some 
J\-rational unramified point t,: if Kv is an ample field, this can be achieved 
for any group G. Let oy, : In 4 G be the representation of Ix associated 
with the extension E/IN(T) and ¢ be the restriction of dx to IIx,: for 
x € Ty, and r € G(s), we have, using Prop.2.2, 


On (#8z,(T)) = Oz) 


Let C = Ceng,G be the centralizer of G in its regular representation and 
y : G(Iv) > C be any homomorphism. {From Prop.2.3, the representation 
© has a Iv-model or : Ty — Nors,G which equals ¢@ on Ix, and equals 
y on (G(K’) (via s) (since here yp € Z'(G(IX),C,1) = Hom(G(Iv),C)). Let 
E*/IK(T) be the field extension associated with the “twisted” representa- 
tion §-: this extension is still Galois over A’, but no longer over K. The 
arithmetic Galois group is the group generated by G and y(G(K)). This 
contradicts Prop.2 of [Fr] since elements in y(G(A‘)) centralize G. 

In the proof of Prop.2 of [Fr], it seems that an inaccuracy occurs when 


the author says (after display (2.6)): “Therefore F°(Y) c F°(Y°) Cc F(Y)”. 
Indeed Y° -+ X is obtained by descent from a cover Y -+ X which is defined 
only up to F-isomorphism. Thus the containments above hold only up to 
F-isomorphism (but not up to F-isomorphism). 

However it is possible to modify Fried’s statement so it holds true. The 
rectified version is the second consequence of Th.4.1 (Cor.4.3 below). 


4.2. Field of moduli {DeDo1;§2]. We recall below some basics about 
fields of moduli. Fix a Galois extension F/I’. Given a mere cover [resp. 
G-cover| f : X — B a priori defined over F, for each t € G(F/K), let 
ff’ : XT — BY” denote the corresponding conjugate [G-]cover. Consider the 
subgroup M(f) [resp. Mo(f)] of G( F/I‘) consisting of all elements Tt € 
G(F/I‘) such that the covers [resp., the G-covers] f and f* are isomorphic 
over F. Then the field of moduli of the cover f [resp., the G-cover f] (relative 
to the extension F/I’) is defined to be the fixed field F!‘P) [resp FMe())| 
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of M(f) (resp. Ma(f)] in F. The field of moduli of a [G-]cover is contained 
each field of definition k such that kh Ck C F. So it is the smallest field 
of definition provided that it 7s a field of definition. 
Assume ¢: IIp(B*) + G C N is the representation corresponding to the 
G-]cover f :X — B over F. Then Wt is the field of moduli of the [G-]cover 
relative to the extension F/I if and only if the ramification divisor D 
ig defined over Av and the following condition — called the field of moduli 
condition —, holds. 


(FMod) = For each U € II,,(B*), there exists py € N such that 


o(z") = pud(t ys" (for all x € IIp(B*)) 


4.3. Statement of the result. Let fp : ¥x — B be a mere cover 
over K and ox : In (B*) + GC Sq be the associated representation. Set 
m= fx Ox F and let f: X — B be the Galois closure over F of the mere 
cover f: X > B. Recall from §2.2 that we assume that f is defined over 
F as G-cover, in other words, that FA (.X,;) is a regular extension of F; 
this assumption is not restrictive if F = Ky, (1.e., in the absolute situation). 
The aim of this section is the following result. 


Theorem 4.1 — Let K/K be the extension of constants in the Galois 


closure of fx _and G be the arithmetic Galois group. Let Ng be the field 
of moduli of f as G-cover (relative to the extension F/K'). Then IX 1s an 
extension of Ng of degree 


[i : Ng] = [(Cens,GNn G: Z(G)| 


4.4. Consequences. In this paragraph, we suppose given a mere 
cover f : ¥ —> B over F and its Galois closure f : x B over F and let 
®: Ilp(B*) > G be the representation corresponding to f. Let Ng be the 
field of moduli of ‘i as G-cover (relative to the extension F/I). 


Corollary 4.2 — Assume that f is defined over NK and that Cens,G = 
Z(G). Then f : X — B is defined over its field of moduli Ng (as G-cover). 


Proof. Let fx be a A-model of f as mere cover. Apply Th.4.1 to get 
K = Kg. Now WN is a field of definition of the G-cover f (Prop.2.4). CO 
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Corollary 4.3 — Assume that Z(G) = {1}. Then f has a unique Ke! 
model fir, : XKg 7 B with no extension of constants in its Galois closure | 
(up to Kg-isomorphism). Furthermore the arithmetic Galois group G of 


each Iv-model fx: Xx > B of fg satisfies Gn CensG = {1}. 


§4.1 shows the last conclusion may fail if fx is a model of f that is not 
a model of fir... 


Proof. The assumption Z(G) = {1} insures that the Galois cover f jg 
defined over its field of moduli Ag [DeDol;Cor.3.2]. That is, the homomor. 
phism @ : Ilp - G extends to an homomorphism Oko : Ing 3 G. The 
representation w : I[p —> Sq associated with the mere cover f is obtained 
by composing @ with some embedding i : G Sy. The homomorphim 
w extends to an homomorphism II,,, — Sg, namely the homomorphism 
1@xq- By construction, the associated Ag-model of f has no extension of 
constants in its Galois closure. The uniqueness of such a [’g-model of f 
follows from Prop.2.5. 

Suppose given a Iy-model fx of fxg, i.e., an extension wy : Wy > S, 
of wx;,. It follows from 


wr Tlic \ = Wig UI.) CG 


aud the definition of A (82.5.1) that K C Kg. On the other hand, it isa 


field of definition of f as G-cover. Therefore Ng C IK. Whence h = Ng. 
Thus Th.4.1 yields 


Cens,GNG = Z(G) = {1} Oo 


4.5. Proof of Theorem 4.1. The field K is a field of definition of 
the G-cover f. Therefore A contains Kg which is the field of moduli of the 
G-cover 3 


Let A: G(F/I’) + N/G be the constant extension map of fx: (in Galois 
closure) where as usual N = Nors,G. From Prop.2.4, G(F/RK) = = Iver(A) 
and for each field k such that A Ck C K, the kernel of the restriction of A 
to G(F/k) is G(F/k) N G( (F/K) = G(F/E). Conclude that 


(1) [KX : k] = |A(G(F/k))| 


Denote the quotient group G- (Cens,GN G\/G by T; it is a subgroup 
of G/G. We claim that A(G(F/IvG)) CT. Indeed, since Wg is the field of 


moduli of the G-cover f, for each U € II, there exists some yy, € G such 
that 
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d(x") = yud(x)pu (r € Tp) 


where ¢:Ur- G is the restriction of éx to Ilp. Now the above formula 
also holds with yu replaced by #4 (U). Conclude that $x(U) lies in G, up 
to an element in Cens,G (which also obviously lies in G = @,(II,;)). This 

roves the claim since the map A is the map induced modulo G by $x over 
G(F/K). Formula (1) then yields 


(2) [K : Ke] < IT | 


Let k be the fixed field in F of A71(T). We claim that Ng C k. Indeed, 
let 7 € G(F/k), v.e., A(r) € LT. Pick an element U € IIx mapping to r via 
the map IIx — G(F'/fx). The element ¢%(U) can be written ¢%(U) =g-c 
with g € G and c € Cens,G. Thus we obtain 


b(z") = o(2)9 (x € Ir) 
This shows that + € G(Aq) and proves the claim. Using formula (1), we 
obtain 


(3) [K : Ke] > [K :k] = |A(A7(P))| = [TF 


which together with (2) completes the proof (the equality |A(A~?(T))| = |I| 
holds since T C G/G and that G/G is the image group of A).  O 


5. The local-to-global principle 


5.1. The problem. We retain the notation of previous sections and 
assume further that A is a number field and F = Q. Assume that the 
[G-]cover f :X — Bcan be defined over each completion A, of AK. Does it 
follow that the cover can be defined over A? We say that the local-to-global 
Principle holds when the answer is “Yes”. In his thesis, E. Dew conjectured 
that the local-to-global principle for G-covers of P! over number fields. This 
was proved in [Del] except possibly for number fields that are exceptions 
to Grunwald’s theorem (the field Q is not exceptional). This result was 
extended to G-covers of a general base space B in [DeDol]. The case of 
Mere covers was then considered in [DeDo2]: the local-to-global principle 
Was shown to hold under some additional assumptions on the group G of 
the cover and the monodromy representation G+ Sq (with d = deg(f)). 
Here we will consider the special case of mere covers that are Galois over 
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Recall that the field of moduli of a cover embeds in each field of definition 
Therefore if a cover is defined over each K,, then its field of moduli js k 
(relative to the extension A/A’). Hence if the field of moduli is a field of 
definition, then the local-to-global principle holds. From {[DeDo}] that ig 
the case if the cover is Galois over Q and condition (Seq/Split) (see §2.3.1) 
holds. The goal of this section is to investigate the problem when the Covey 
is Galois over Q but condition (Seq/Split) is not assumed. 


5.2. SG-covers. Our treatment uses SG-covers which are more genera] 
objects than [G-|covers. They were originally introduced in [DeDol;Fina} 
Note] (under a different name). Here we will only consider Galois SG-covers 
Given a group G anda subgroupI C G, a Galois SG-cover of fized subgroup 
I’ c G over Iv is a mere cover f : X — B over K which is Galois over 
and is given with an isomorphism G ~ G(A’'(X)/A(B)) and an embedding 
PD <> Aut(A(X)/A(B)) such that the following square diagram commutes 


G(A(X)/K(B)) 


Aut(K(X)/K(B)) 


For = {1}, SG-covers are mere covers; for ! = G, SG-covers are G-covers. 
A SG-cover f : X — B over K is defined over K if the mere cover f together 
with the automorphisms in I can be defined over K. 

From {[DeDol;Final Note], isomorphism classes of SG-covers of fixed sub- 
group [Cc G f : X + B over K correspond to surjective homomorphisms 
@ : Il;¢-(B*) > G regarded modulo conjugation by elements of the group 


(1) N = Nors,GN Cens, (I*) 


where 


- the embedding G C Sq is given by the (free transitive) action of G on the 
d conjugates of a primitive element of the extension A(X)/K(B), and 

- I* is the image of [I via the classical anti-isomorphism * : G > Ceng,G: 
identify each g € G with the element in Sg induced by the left-multiplication 
by g; then the map * send g on the right-multiplication by g. 


For I’ = {1} (mere cover case), we have N = Nors,(G); for T = G (G-cover 
case), we have N = G. Thus in general, we have G C N C Norg,(G). 
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As in both the mere cover and G-cover situations, we have the following 


assertions (which generalize Prop.2.1). The group N is the one given in (1). 


2) (a) A SG-cover f : X > B of fited subgroup T C G over K, corresponds 
to a representation ¢: Ilx,(B*) *+GCN 
(6) The K-models of f (as SG-cover) correspond to the homomorphisms 
Ax(B*) + N that extend the homomorphism ¢ : Ix,(B*) ~ G. In 
rticular, the SG-cover f can be defined over the field K if and only if 
the homomorphism © : Il,,(B*) + GCN extends to an homomorphism 
fix(B*) > N, 
(c) Two [G-Jcovers over K, are isomorphic if and only if the correspond- 
ing representations @ and ¢' are conjugate by an element y in the group N, 


that is, ¢'(z) = 9O(x)ye~* for all x € Ix, (B*) 


The map * can be described more intrinsecally. Denote by Gy the group 


Gy =G =< Aut(G) 


The group G4 is isomorphic to Nors,(G) (e.g. [DeDol;Prop.3.1]). Next use 
the embedding G + Gi, sending g to (g,1) to identify G with a subgroup 
of G, (still denoted by G). For each g € G, denote the conjugation by g by 
€y (¢g(h) = ghg~'). Then the anti-isomorphism * is the map 


G-> Gy 
g > 9° =(9,¢5") 


Some groups play a central role in [DeDol] and [DeDo2]: they are N 
(defined in (1)), C = Cenn(G), N/C and CG/G~C/Z(G) ?. 


Proposition 5.1 — Denote the subgroup of Aut(G) of all automorphisms 
X that are trivial on T by Autr(G). Then we have. 


(a) N = {(y,x) € Gily € G,y € Autyp(G)}. 

(6) N/G = Autp(G). 

(c) C = (Ceng(L))*. 

(d) CG/G is isomorphic to Ceng(P)/Z(G), which embeds in Autr(G). 

(e) Z(G) is a direct summand of C if and only if Z(G) is a direct summand 
of Ceng(T). 

(f) Z(G) C Z(N) if and only if for each yx € Autr(G), we have x|z(G) = 1. 


Proof. By definition, N = G4 Ceng,(I*) = Cene, (I*) and C = 
Ceny(G); (a) and (c) follow straightforwardly. The projection Gy — 
gg Ee ce ce 

2(G) Should a priori be replaced by CNG, but CNG=Z(G) in general. 
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Aut(G) mapping each (u,v) € Gy to x € Aut(G) factors through G4/G 
to yield the isomorphism G4/G ~ Aut(G); (b) and (d) follow immediately, 
The anti-isomorphism * sends Ceng(I’) onto (Ceng(I))” and Z(G) onto 
itself. This provides an isomorphism Ceng(I)/Z(G) ~ (Ceng(P))” /Z(G) 
and proves (e). Finally (f) readily follows from (a) and the definitions. 0 


5.3. The local-to-global principle for Galois covers. [DeDog| 
studies the local-to-global problem for mere covers, 1.e., SG-covers of sub- 
group T = {1}. But the paper was written in a more general setting. 
Namely the objects we considered were representations ¢: In, -* GCN, 
In [DeDo2] we were mainly interested in mere covers, which correspond to 
N = Norg,(G), and G-covers, which correspond to N = G. But from (2) 
above, [DeDo2] applies more generally to SG-covers provided that N is un- 
derstood as in (1), C as Ceny(G). In this context, the second Theorem in 
§1 of [DeDo2] rewrites as follows (using Prop.5.1). 


Theorem 5.2 — Assume that X = Q, or more generally, that K 1s a 
number field for which the special case of Grunwald’s theorem does not occur. 
Then the local-to-global principle holds for SG-covers of fized subgroup TC 
G satisfying simultaneously the five conditions below. 


(i) For each x € Autr(G), we have x|zg) = 1, 

(ii) Z(G) ts a direct summand of Ceng(l) 

(1/1) Z(Ceng(T)/Z(G)) is a direct summand of Ceng(l)/Z(G). 
(1i/2) Z(Ceng(T)/Z(G)) Cc Z(Autr(G)). 

(iti/s) Inn(Ceng(T)/Z(G)) has a complement in Aut(Ceng(T)/Z(G)). 
These five conditions hold for example if Ceng(T) = Z(G) CT. 


The local-to-global principle also holds for SG-covers satisfying simultane- 
ously conditions (1), (11) above and the following condition 


(iii)’ Ceng(T)/Z(G) has a complement in Autr(G). 


Originally we were interested by the local-to-global principle for Galois 
mere covers. Th.5.2 above can be used as follows. Suppose given a Galois 
mere cover defined over Q, for each p. If in addition there is a subgroup 
I Cc G such that for each p, the mere cover has a model defined over Q) 
along with the automorphisms in I, then f satisfies the local assumption 
of the local-to-global principle not only as mere cover but also as SG-cover 
and Th.5.2 may be used. The bigger I the stronger the hypothesis. If 
I = G, 1e., is as big as can be, then the hypothesis does insure that the 
mere cover is defined over Q (it is even defined over Q as G-cover). The 
weakest hypothesis is for T = {1}. Although no example is known, it is 
most likely that the local-to-global principle does not hold in general. 
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We state some intermediate special cases. In both statements below, 
= Q, or more generally, J. is a number field for which the special case of 
Grunwald’s theorem does not occur. 


Corollary 5.3 — Let T be a subgroup of a group G such that Ceng(T) C 
TaZz(G) (e.g. Cena(l) = {1}). Then the local-to-global principle holds for 
all SG-covers with fized subgroup CG. 


Corollary 5.4 — Let T be a subgroup of a group G such that Z(G) CT, 
Z(G) is a direct summand of Ceng(T) and Ceng(T)/Z(G) has a comple- 
ment in Autr(G). Then the local-to-global principle holds for all SG-covers 
with fized subgroup TCG. 


Indeed, assumptions in Cor.5.3 guarantee that Ceng([) = Z(G) CT. 
Note that if they hold, then necessarily Z(G) = Ceng(T) = Z(T). As to 
Cor.5.4, the assumptions guarantee here that conditions (i), (ii) and (iii)’ 
of Th.5.2 hold. Assumptions of both Cor.5.3 and Cor.5.4 hold if T = G. 
i.e., in the G-cover situation. Thus both these results generalize Th.3.8 of 


{[DeDol). 
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Curves with infinite K-rational 
geometric fundamental group 


Gerhard Frey, Ernst Kani and Helmut Volklein 


1 Rational Geometric Fundamental Groups 


Let K be a finitely generated field with separable closure K, and absolute 
Galois group Gx. By a curve C/K we always understand a smooth geomet- 
rically irreducible projective curve. Let F(C) be its function field and let 
II(C) be the Galois group of the maximal unramified extension of F'(C’). We 
have the exact sequence 


1+ H,(C) ~ W(C) + Gx - 1 (x) 


where II,(C’) is the geometric (profinite) fundamental group of C x Spec(K,) 
(i.e. II,(C’) is equal to the Galois group of the maximal unramified extension 
of F(C) @ K,). 

This sequence induces a homomorphism pc from Gx to Out(II,(C)) which 
is the group of automorphisms modulo inner automorphisms of I,(C). It 
is well known that po is an important tool for studying C’. For instance, it 
determines C up to A —isomorphisms if the genus of C is at least 2 and K 
is a number field or even a p—adic field (see [Mo]). 


So it is of interest to find quotients [(C) of II(C) such that the induced map 
of Gx is not the identity but the induced representation po becomes trivial. 
We give a geometric interpretation of such quotients. For this we assume 
that the sequence (x) is split and choose a section s which induces a homo- 
Morphism o from Gx to Aut(II,(C)). Let U be a normal subgroup of II(C) 
contained in II,(C). The representation pc becomes trivial modulo U if and 
only if the map o mod U has its image inside of Inn(II,(C)/U). 

Let Z be the center of I1,(C)/U and fl = (II,(C)/U)/Z. Our condition on 
U implies that there is exactly one group theoretical section § from Gx to 
(I(C)/U)/Z inducing the trivial action on II and so I] occurs as Galois group 
of an unramified regular extension of F(C) in a natural way. 


Thus, to find center free infinite factors of Il, on which pc becomes trivial 
IS equivalent with finding infinite regular Galois coverings of C. Choosing 
as base point a geometric point of C we can say that “II is a factor of the 
&€ometric fundamental group of C over K”. 


Remark 1.1 If s can be extended to a (group theoretical) section into 
I(C)/u then again it induces the trivial action on I1,(C)/U and hence this 
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group occurs as Galois group of a regular unramified extension of F(C). This 
is always true if K is a finite field or if I1/U is abelian. 


But there is a difficulty: The sections s corresponding to different quotients 
of II, may not be compatible ( i.e. the composite of the corresponding 
coverings are not regular) and so we do not have a “maximal unramifieg 
regular covering” of C over Kk. 


So we specialize, assume that C has a K-—rational point P and choose a 
splitting sp of (x) corresponding to P by identifying Gx with the decompo. 
sition group of an extension of the place corresponding to P in F(C) to its 
separable closure . Now the finite quotients of II,(C’) on which sp(Gx:) op- 
erates trivially correspond to unramified Galois coverings C’ of C on which 
the decomposition group of P operates trivially. Hence P has K-rational 
extensions to C” and the choice of sp corresponds to the choice of such an 
extension P’. (If we make another choice, then the corresponding section is 
replaced by a conjugate in II(C).) We can regard C’ as etale covering of C 
with respect to the base points P resp. P’. 

Taking the limit we get the A-rational geometric fundamental group 
of C with base point P : 


(C, P) := 11,(C)/((sp(o) — 1)Mg(C)) we: 


Remark 1.2 The construction of unramified coverings with base points is 
also of interest for coding theory: 
Assume that g(C) > 1. Since 


g(C’) —1=[C’: C](gc — 1) 
and #{P’ € C’(K’)} divisible by [C’: C] we get (if C’(K) # 0) that 
OK) 


1 


QO") =—1 ~ (ge — 1) 


and this means that C’ lies in a “good family” of coverings and so it is 
desirable to have infinite towers in this restricted sense over curves C' (of 
small genus). 


If A is a finite field such towers can be constructed by using class field theory 
of curves over any finite field (cf. Serre[Sel]) or over fields of square order 
by using Shimura curves (cf. [Ih]). By Ihara’s method one can even obtain 
curves of genus 2 which are optimal. The curve given by y? = 2® — 1 is such 
an example. 

All these known examples are defined over finite fields and are of very special 
type. 
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‘bb this paper we shall use either quartic coverings of the projective line or 
yadratic coverings of elliptic curves with enough ramification points (in- 
stead of quadratic coverings of the projective line as in the class field tower 
constructions) to find for every genus g > 3 curves with infinite geometric 
fundamental group defined over Q(z) or over F,(z) (where 7 is a forth root of 
ynity) and we find even parametric families of such curves over every ground 


feld containing 2 (cf. Theorem 5.22). ? 


More precisely we get the 


Result: Let K be a field of characteristic # 2,3 which contains a fourth root 
of unity. Tike = ia withbe k*, #1. 
Let C be a cyclic covering of degree 4 of the projective line given by an 
equation 

st = (1+ b*)t(t? — 1)(t — a)g(t) 
with a polynomial g(t) such that g(0)g(1)g(-1) #0 and g(a) = 1. Then C 
has an infinite K-rational geometric fundamental group with base point P, 
corresponding to the place t =a. 


We apply this and get the 


Consequence: Let 6 and a be as above. 
Let Ey be the elliptic curve defined over K by the equation 


Ey : y* = (1+-b4)a(2*—-1)(x—a). 


Then for every natural number g > 3 there exist quadratic coverings C of 
E, defined over K of genus go = g with infinite K -rational geometric funda- 
mental group with base point lying over P,, the point of Ey corresponding to 
t=a., 


Example: Let K, 6 and a be as above. The curve 
Ci: y* = (14 b*)2(2? — 1)(2 — a) 
has an infinite K-rational geometric fundamental group with base point (a, 0). 


We thus have examples of curves of genus g with an infinite K-rational ge- 
Ometric fundamental group for every g > 3. No such example can exist for 
curves of genus g = 1, as will be explained in the next section. This leaves 
only the case g = 2. Here (cf. Example 4.13) we shall find very special curves 
with an infinite tower of regular unramified Galois coverings but we cannot 
decide there whether there are rational points in the tower. 


There are other interesting aspects related to curves of genus 2 in the context 
of the questions discussed in this paper which will be investigated in more 
detail in the paper [FKV)]. 


‘— 


“We remark that we do not have any example of a curve defined over Q with infinite 
Qrational geometric fundamental group. 
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2 Abelian coverings 


In this section we shall give a short review of the special case of abelian 
unramified coverings. 

The largest abelian factor of II,(C’) which appears as the Galois group of q 
regular unramified extension of F'(C) is II,(C)*/((sp(o) — 1)T4(C)") cea, 
and so it is independent of the choice of P or of the choice of the splitting, 
(But the coverings will depend on the choices.) Moreover, one sees that every 
abelian group which can be realized as Galois group of a regular unramified 
covering of C can be realized in such a way that a given point P € C(K) 
splits completely (cf. [V] or [Se]). So we do not have to distinguish between 
extensions with or without K-rational base points as long as we only want 
to describe the occurring Galois groups. 


Let us begin by looking at elliptic curves. Here we have the following finite. 
ness result: 


Theorem 2.1 Let K be a finitely generated field. Then there is a num- 
ber N = N(K) such that for all elliptic curves E defined over K we have 
IE (1 ) tor | < N. 


Proof. We use induction with respect to the transcendence degree of K over 
its prime field Ko. 

If K is a finite field the claim is obvious. If K is a number field the theorem 
of Merel gives a bound N depending on [K : Q] only (cf. [Me]). 

Let K be transcendental of degree dy > 1 over Ko. We fix a subfield K, C K 
such that J, is finitely generated over Ko and such that K is a function field 
of one variable over Ay of genus gx. 

Let E/K be an elliptic curve with a K-rational point P of order n. Let Ky 
be the smallest extension field of Ay in K over which E is defined and over 
which P is rational. 

If 2 is equal to Ky we can use the induction hypothesis, otherwise K, is a 
function field over Ky with a K,-rational embedding of the function field of 
the modular curve X,(n) into K2 and hence into K. 

Thus, gx bounds the genus of X;(n) which is of size O(n?) (cf. Miyake[Mi], 
if (n, char(A’)) = 1 and Igusa{Ig], for n = @", € = char(K)), and so n is 
bounded. 


Corollary 2.2 There is a universal bound n = n(K) such that for all regular 
unramified Galois covers ¢ of elliptic curves E over K we have deg(¢) < n. 


Proof. If ¢ is an unramified Galois covering of E, then the covering curve 
E" is an elliptic curve too, ¢ is an isogeny and the Galois group Gx acts on 
the kernel of ¢ trivially, i.e. Ker(¢) < E’(K). Using the universal bound for 
torsion points obtained in the theorem the corollary follows. 
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we consider the case that the genus of C is larger than 1, and study as 
pefore the existence of regular unramified abelian Galois coverings of C. 


proposition 2.3 Ifo: Ci — C is a regular abelian unramified covering, 
then deg ? is bounded by a number depending on K and on C. 


proof. By Serre [Se] we know that there is an abelian variety A/ A’ such that 


¢ induces an isogeny 
oO: Aa Jo 


with Ker(¢.) C A(A) and |Ker(¢,.)| = deg(¢). If K is finite we have 
|A(K)| = |Jc(K)|, and so we are done. If K is a number field or a field 
of transcendence degree larger than 1 over its prime field we can use reduc- 
tion theory to get the result. 


Remark: Since at present we do not have an analogue of Merel’s theorem 
for abelian varieties, we can only find a bound depending on the genus and 
on the conductor of C, and not only on K, even if K is a number field. 


3 Coverings of P! with given ramification type 


3.1 The Construction Principle 


In the last section we had seen that for a given curve C over a finitely 
generated field A’, the degree of the abelian unramified regular extensions of 
C/K is bounded. 

The situation changes if we look for non-abelian coverings. Indeed, in the 
next section we shall give examples of families of curves of arbitrary genus 
9 = 3 defined over a finitely generated field AK which have infinite towers of 
unramified regular Galois coverings. However, it should be remarked that 
the dimension of these families is small compared with the dimension of 
the moduli space of curves of genus g and so it can be conjectured that 
for “general curves”, the degree of a regular unramified Galois covering is 


bounded by a number N(g, K). 


The construction of these curves is based on the following principle. We 
begin with the projective line P' and choose a finite Gx-invariant set So 
of points of P'. Then we look for Galois coverings OC’ of P’ with a simple 
Galois group G (in our examples G is a projective linear group). We require 
that this covering is unramified outside of Sp and that the ramification in 
Points P, € S, has an order ep, which is small and prime to char(/). We 
take a Galois covering Co of P! disjoint to C’ which ramifies in all P; with 
@n order which is a multiple of ep. By Abhyankar’s Lemma it follows that 
C’ xp Co/Cp is unramified with Galois group G and, because of the simplicity 
of G, regular. 


90 FREY, KANI & VOLKLEIN: Curves with infinite fundamental groy, 


To find C’ we follow [FV] and use Hurwitz spaces which are moduli spaces for 
the ramification cycle configuration inside of G; this boils down to proy; 
the existence of K-rational points on these spaces. This Diophantine Problep, . 
leads to interesting relations between arithmetic geometry and group theory : 
Sometimes group theory or, to be more precise, the theory of rigidity IMplieg 
the existence of rational points on the corresponding Hurwitz spaces. Thig 
is the case in our examples which are related to Thompson tuples, and $0 
in this case the existence of such coverings is obtained by pure group theory 
(and Riemann’s existence theorem). 

However, this method fails to produce K’-rational points on these coverings 
and so arithmetic-geometric methods have to be invoked in order to Prove 
the existence of such points (which is necessary for our considerations of 
K’-rational fundamental groups). 

In the case of the Thompson tuples, such a method is readily available, for 
behind their very construction (cf. [V3]) lies a deep arithmetic reason, viz. 
the existence of a family of abelian varieties with good reduction. 


3.2 Rigidity 


For the convenience of the reader we shall now give a very short overview over 
rigid systems of conjugacy classes. We shall assume that char(K) = 0, but 
actually it is enough to avoid fields with characteristics dividing the order of 
the occurring Galois groups and we shall use this fact in our examples. 

We want to describe the regular Galois coverings of the projective line. For 
this, we recall the situation over the complex numbers over which we can use 
as crucial tool the Riemann existence theorem. 


We have the following invariants for a finite Galois extension L/C(z): its 
Galois group G, the finite set P = {P,,..., P,} of branch points (which lie in 
P'(C) = CU {oo}) and for each branch point P; the associated conjugacy 
class C; of G consisting of the distinguished inertia group generators over 
P;. Define the ramification type of L/C(zx) to be the class of the triple 
(G,P,C), where C = (C,...,C,). We say that two such triples are in the 
same class if the sets P are the same and if there is an isomorphism between 
the groups that identifies the conjugacy classes associated with the same 
point of P. 


The extension L/C(z) is defined over a subfield K of C if there is a Galois 
extension Lx /K(xr), regular over A’, which becomes equivalent to L when 
tensored with C. 


C is weakly rigid if any two generating systems 0),...,0, of G with o; € Ci 
and o,---o, = 1 are conjugate under an automorphism of G and if there is 
one such generating system. By Riemann’s existence theorem, C is weakly 
rigid if and only if there is exactly one Galois extension of C(x) of type 
(G,P,C) (up to equivalence), see [V], Th. 2.17. This uniqueness can be 
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Joited to define the covering of P’ over a smaller field. To do this, we 
parpen the group theoretical conditions for the ramification type as follows. 
‘$s 


C is quasi-rigid if it is weakly rigid and every automorphism of G fixing 
each Ci is inner. 

Now suppose C is quasi-rigid and that L/C(x) is a corresponding extension. 
pet L’ be the fixed field of the center Z(G) of G. Then the extension L’/C(x) 
is defined over the field generated over Q by all branch points and by all roots 
of unity of order ord(a), 0 € C,U...UC,. The exact minimal field of definition 
of L' may be even smaller and can be determined using the branch cycle 
argument (see [V], 3.2 and 3.3.2). This construction principle for Galois 
extensions of rational function fields (“rigidity criterion”) has successfully 
been used to realize given groups as Galois groups. 


4 Construction of infinite towers of unrami- 
fied curve covers 


4.1 Thompson tuples and Belyi triples 


Let q be a power of the prime p, and n > 3. We denote by F; the multiplica- 
tive group of the field F, with q elements. For any o € GL,(q), let ¢1,...,¢n 
be the eigenvalues of o (counted with multiplicities) and y,(T) = [1,(7 - «) 
its characteristic polynomial (normalized to be monic). We call o a per- 
spectivity if it has an eigenspace of dimension n — 1. 


Definition 4.1 Let r = n +1 (respectively, r = 3). A Thompson tuple 
(respectively, a Belyi triple) in GL,(q) is an r-tuple (o1,...,0,) such that 
%},...,0, generate an irreducible subgroup of GL,,(q), their product satisfies 
%:--o, = 1 and og; is a perspectivity for all 7 = 1,...,r (respectively, for 


t= 3). 
In [V1] one finds 


Theorem 4.2 Suppose that (a1,...,0,) and (o},...,0.) are Thompson tuples 
(respectively, Belyt triples) in GLn(q) with Xo, = Xo" for alli. Then there ts 
an element g € GL,(q) with of = gu o,g for alli. Thus oj,...,0, are weakly 
rigid generators of G = (o1,...,0,), and they are quasi-rigid generators if the 


Rormalizer of G in GLn(q) equals F*G. 


We can apply the theory of rigid ramification types presented in the last 
Section to obtain (cf. [V], Theorems 2.17, 3.26 and 7.9): 
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Corollary 4.3 Let (oj,...,0-) be a Thompson tuple or a Belyi triple in 
GL,(q). Let C; be the conjugacy class of o; in G = (04,...,0r), and ley 
PriyioyP, Ge distinct points of P*(C).. Put.BP— {Pis.4. 2} Car = Cs and 
C =(Cp)pep. Then we have: 


(a) There is a unique Galois extension L'/C(x) of ramification type [G, P, Ci, 


(6) Suppose that the normalizer of G equals FG. If L = Fix(Z(G)) denotes 
the fized field of the center Z(G) of G, then the extension L/C(x) is defined 
over any subfield kK C C which contains all the (finite) P, € P. and all roots 
of unity of order ord(a;). In particular, there is a regular Galois extension 
Ig of K(x) with Galois group G = G/Z(G) with ramification type |G, P, CI, 
where C'p is the image of Cp inG. 


Corollary 4.4 Suppose in addition that M is a regular extension of K(x) 
which ts linearly disjoint to the above extension Lg over K, and which has 
the property that for eachi = 1,...,r and each place m of M lying above the 
place corresponding to P, of C(x), the ramification index of m is a multiple 
of ord(a;). Then the composite Lj. M is an unramified Galois extension of 
the function field M and its Galois group is isomorphic to G. 


We therefore see that to find such field extensions L, it is enough to find 
Thompson tuples or Belyi triples. These will now be investigated in more 
detail. The first result in this direction (taken from [V1]) shows that they 
can be characterized by their characteristic polynomials: 


Theorem 4.5 Let fy,..., f, € F,[T] be monic polynomials of degree n with 
IT; f(0) = (-1)". 

(a) There exists a Belyi triple (01, 02,03) in GLn(q) with yo, = f; if and only 
if fa(T) = (T — c)""*(T — d) for elements c,d € FY satisfying abc # 1 for all 
a,b F, with f(a) = fo(b) = 0. 


(b) There exists a Thompson tuple (o1,...,0,) in GLa(q) with yo, = fi of 
and only if f(T) = (T —a,;)"""(T — 6) for elements ay,...,a,, ,...,b, € F; 
satisfying a,---a, #1 and b, ]],,,4; #1 for allj =1,...,r. 


From Theorem 4.2 we see that the tuples (o1,...,0,) are classified by their 
characteristic polynomials y,,, and so the group G = (0j,...,0,) depends 
only on the y,, (up to conjugation). In the case of Thompson tuples there 
is a complete classification of the related groups (cf. [V1] and [V4]). In the 
following Theorem 4.6 we shall state only that part of this classification which 
will be relevant for us below. 

In order to state the result, we require the following terminology. A Thomp- 
son tuple (04,...,0n41) is called normalized if y., = (T —1)"71(T — },) for 
an ee fF 
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peorem 4.6 Let n be odd and > 3. Suppose (01,...,On41) ts a normal- 
‘eed Thompson tuple in GL,(q) generating the group G with x,,(T) = (T —- 

n-1(T — b;) for certain 6; EF, fori =1,---,n. [fn =3 assume that some 
hj has multiplicative order > 3. Suppose that F, = Fp(an41,61,-++On41), and 
qssume that if q = 9 is a square, then not all the a,, b; have norm 1 over 


Foo: Then 
; SLi(q) < G< GL,(q). 


4.2 Curves with infinite towers of unramified regular 
Galois extensions 


We now apply the above results to construct unramified Galois G-covers of 
curves with group G = PSL, (p) for suitable values of n and p. In order to 
be able to specify precisely which values are suitable here, we first introduce 
the following notation. 


Notation. Let n > 3 be an odd natural number, and let d),...,d,4, > 3 be 
integers. Then we denote by P(n,d;,---,dn41) the set of all prime numbers 
p satisfying the following conditions: 


(i) (n,p— 1) = 1; 


(ii) there are elements }),...,On41 € F* with 6) --+ 6,4; = 1 such that each 
b; has multiplicative order d; in Fe 


A trivial but useful fact is 


Lemma 4.7 /f d,,...,d, are prime to n and dn; = lem(d,,...,d,), then 
every prime p with p $1 mod | for all primes 1 | n and p = 1 mod dns 
ts in P(n,dy,---,dn41). Moreover, there is a constant c (depending only on 
nand dy,...,dn4,) such that for all primes | > c there are infinitely many 
PE P(n,d,,--+,dn41) such that | does not divide the order of GL,(p). 


From now on, suppose that the integers d,,...,d,41 satisfy the conditions of 
this lemma, and that K is a field of characteristic 0 containing a primitive 
d" root of unity, where d = Icm(d),...,dn41,2) = lem(dn41,2). Moreover, 
let pe P(n,d1,+++,dn41) and choose b; € F* such that condition (ii) holds. 
Then by Theorem 4.5 there is a normalized Thompson tuple (01,..., 41) 1m 
GL, (p) corresponding to these 6; with a,4; = —1. The order of 0; equals d; 
or? <n and it equals Icm(d,41,2) =d for2 =n +1. 

By Theorem 4.6 the group G = (0},...,0n41) contains SL,(p). Thus the 
Stroup G = G/Z(G) is isomorphic to the simple group PSL,,(p) (because we 
assumed (n,p—1)=1 ). 

t remains to choose the ramification points. 

Since the automorphism group of P! acts sharply triple transitive we can 
assume without loss of generality that P,.; = 0, P, = 1 and Pa4i = ©. The 
Other ramification points are then denoted by t),:+-,tn-2 € A. 
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Using Corollary 4.3 we see that for every p € P(n,di,---,dn4i), we get a 
regular Galois extension Fy of A(X) with Galois group PSL,(p) ramified 
only in 0,1, 00, ),...,t,-2 with ramification indices dividing d. 

Since for different primes p the Galois groups of the F”’s are non-isomorphj¢ 
simple groups, the composite field 


Vv s Ald 
EBlidi juliet = li 7 


pEP(n,di ,-,dn41) 
has the same regularity and ramification properties. 


Next, let MM, be any regular extension field of K(X) such that every extension 
of the places 0, 1,00, t,---,t,~2 is ramified with an order divisible by d. (For 
instance, take M, as a cyclic cover of K(x) of degree d totally ramified at 
0,1, 00,t,:++,t,-2. In this case the genus of M, is equal to (n—1)(d—1)/2.) 
Using Corollary 4.4, we get that Fad tntiigs) - My is a regular unramified 
extension of M, with Galois group equal to 


[I] PSL n(p). 


peP(n,di,--.dn41) 


Thus, if we take KH = Q(Cyg,ti,...,tn-2), where the ¢,,...,t,-2 are alge 
braically independent over Q, then we obtain: 


Theorem 4.8 Let n > 3 be an odd number and let d,,--+,d, > 3 be in- 
tegers prime to n, dyy, := lem(dy,---,d,) and d = lem(2,dn41). Then 
P(n,dy,-++,dn41) is an infinite set and there is an (n — 2)-dimensional ra- 
tional family C of curves of genus (n —1)(d —1)/2 defined over Q(q) such 
that for all p EP(n, d;,+++,dn41) there is an unramified regular Galois cover 
C, with Galois group PSL, (p) and the composite of these covers is an infinite 
unramified regular Galois tower. 


On the other hand, if we choose the #,’s to be algebraic over Q, we obtain: 


Theorem 4.9 Let A be a number field. Let n be odd and at least equal to 3 
and let d be as in the previous theorem. Then there are infinitely many curves 
of genus (d—1)(n — 1)/2 defined over K with infinite towers of unramified 
regular Galois covers defined over K(Cq). 


As was mentioned above, the Hurwitz space theory and the theory of Thomp- 
son tuples works also over fields of characteristic | as long as / is prime to 
the order of the Galois group. Using the second part of Lemma 4.7 we see 
that Theorem 4.8 holds if we replace Q by F; for almost all primes /. Hence 
we have 
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sprollary 4.10 Let n and d be as above, and let | be a sufficiently large 
F rime such that there are infinitely many primes p with p'—1 prime tol for 
a 4 i, 7. Then there exist curves of genus (d —1){n —1)/2 defined over 
K =F;j(Ca) which have an infinite tower of unramified regular Galois covers 


defined over Kk. 


Now we discuss the simplest possibility and get the following example which 
will play an important tole in Section 5.3. 


Example 4.11 Letn =3and dj =... =dy=d=4, and let K bea field of 
characteristic > 5 which contains a fourth root of unity ¢, = 7. Furthermore, 
tet t € K be such that t # 0,1. 

Then for every p = 5 mod 12 we get an extension of P! defined over K 
whose Galois group is equal to PSL3(p) and which is ramified of order 4 in 
{0, 1,00, t}. 

Fix an integer go > 1. If go = 1, let Cy be an elliptic curve which covers P’ 
with ramification points {0,1,00,t}, and if gg > 1, let Co be a hyperelliptic 
curve of genus gg whose set of Weierstrass points contains {0,1,00,t}. Let 
C be a quadratic extension of Cy ramified in the extensions of {0,1, 00, t}. 
Then C has genus g := go + 6/2, where 6 is the number of ramified primes in 
the cover C/Co. 

For example, if we choose C as a cyclic cover of degree 4 over P! which is 
ramified in the Weierstrass points of Cg with ramification order 4 and in s 
additional points of P’ with ramification order 2, then g = 399 + s. 


Hence we get 


Corollary 4.12 Let WK be a field of characteristic # 2,3 which contains a 
primitive fourth root of unity: = (4. Lett € K, t #0,1 and let Co be either 
an elliptic curve (go = 1) or a hyperelliptic curve of genus go defined over 
K with at least four K-rational points of order 2 respectively, Weierstrass 
points equal to {0, 1,00, t}. 

Then for all non-negative integers s there are quadratic covers of Co defined 
over IK of genus 3go +s with an infinite tower of regular unramified Galois 
covers over Kk. 


If we want to obtain examples as above for curves of genus 2, then we have 
to use Belyi triples, but in that case we find only isolated examples. Here is 
ne of them: 


Example 4.13 Let ®,, be the cyclotomic polynomial of order m, and let 
(01, 02,03) be a Belyi triple in GL3(p) with characteristic polynomials y~, = 
(T + 1), yo, = (T — 1), and yo, = (7 —1)?(T +1). Then the images 


1,02,3 generate the simple group PSL3(p) for p = —1 mod 3, and there is 
@ corresponding extension L,/Q(z) branched at 0,00,1. Take x = (sand Jet 
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M/Q(t) be the genus 2 function field extension branched where t is a Sixth 
root of 1, i.e. M corresponds to the curve with equation 


y= t§- 1, 


Then L, -M/M is unramified with group PSLa(p) and defined over Q. j; 
follows that the curve y? = t® — 1 has an infinite tower of regular unramifieq 
Galois extensions over Q and over F; for all primes / > 5. 


We observe that although we obtain in this way the example of Ihara mep. 
tioned in the first section, we have actually realized different Galois groups by 
this method. At present, however, we cannot answer the question concerning | 
the existence of rational points in the associated infinite tower. 


Further examples (for PSL3(p) again) are obtained by taking y,, as above, 
and x, = (T — 1)®3, ye, = (FT + 1)®4, or alternatively, yo, = (T + 1)4,, 
Xo, = (T —1)®,4. We thank G. Malle for pointing out these examples. 


4.3 Geometric Interpretation of Thompson tuples 


In the last section, the examples of almost unramified covers of P! were 
constructed by using group theory. This construction did not provide any 
method for determining the existence of rational points in the covers, nor did 
it explain the arithmetical “reason” behind the existence of such examples. 
However, both these aspects are greatly elucidated by using the results of 
[V3] to relate the Thompson tuples to torsion points of Jacobians of curves 
which are cyclic covers of the projective line. 


The basic idea here is the following. If ¢ : C — P is a cyclic covering 
with group Z = Aut(@) defined over a field K and \: Z — F° is a faithful 
character, then the Galois group Gx acts F,-linearly on the A-component 
Jc[p|, of the group Jo[p} of p-torsion points of the Jacobian variety Jc of 
C’. Now it turns out that if the ramification points t),...,tn42 € K of ¢ are 
in “sufficiently general position“, then the image of the Galois group Gx i 
GL,.(Jo[p],) is (essentially) generated by a Thompson tuple, and conversely, 
every Thompson tuple arises in this fashion. 


In order to make this connection between cyclic coverings and Thompson tu- 
ples more precise, it is useful to introduce the following alternate description 
of Thompson tuples, using the Braid group. 

For this, let s = n+2, and let Qi,...,Qs-1 be the standard generators of the 
Artin braid group B, on s strings. Mapping Q; to the transposition (2,2 + 1) 
yields a homomorphism « : B, — S;. The kernel of « is called the pure braid 
group, and is denoted by Bs). It is generated by the elements 


Qy-= Qj Oi G3 a Gp” Vgt< I Ss. 
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‘he braid group B, acts on tuples (g1,...,g;) of elements of any group by 
the tule that Q; maps (g1,---,gs) to 


(9iy+++)Gi-1) Jiti, GAGGIA y+++99s) 


ne (4) We = UGte-++3 Cn42) is an (n+2)-tuple of elements ¢, € F; 
satisfying the conditions 


n+2 


[[6 =1:  S.# 4) for alls, (1) 
j=l 


then there is a unique (up to conjugation) normalized Thompson tuple g = 
a = (01,+-+,%n41) in GLa(q) such that 
Xo(T) = Penyryrag aah Lei'sn, 
Xons1 (T) = (T — Gaal” (Tt — any} 
Moreover, every normalized Thompson tuple o in GL,(q) is of the form g = 
Ze for a unique (n + 2)-tuple ¢ satisfying (1). 


(6) In the situation of (a) there is a homomorphism 


o, : B’) — GL, (q) 


such that fori = 1,...,8s —1 the elements 7; := ®¢(Qis) are perspectiv- 
Mies with y,, = (T —1)"(T —C¢7!) and y---17,_1 = C5}: id. Thus 
(m,...,7s—2,CsTs-1) is a Thompson tuple associated with ¢. 


Proof. (a) Put s = n+ 2. Recall from Theorem 4.5 that a normalized 
Thompson tuple (o1,...,0n41) is characterized by the characteristic poly- 
Nomials of the elements a; and so by the s-tuples (b1,-+-,6n41,@) € (F,)"*? 
which satisfy the additional properties that 2 bal and thata sé 1, 
bagi Fl anda-b #1 forl <i<n. 

We associate to (04,.-.,On41) the s—tuple ((,,+++,Cn41,¢5) with ¢, = (ab;)7? 
forl <i<n, Ci = ae and ¢, = a. This tuple has the property that all 
Gi are different from 1 and that ie ate 

Conversely, if C1,+++5Cn42 € F* satisfying condition (1) are given, then a := 
Cnt2, Ona = Cngi and bj c= ¢7*¢7),, for 1 < i < n satisfy the above 
Conditions and hence give rise to a Thompson tuple a, = (01,...,@n41) with 
the indicated characteristic polynomials. . 

(b) For the definition of - see [V4] and [MSV]. (It is essentially the Gassner 
representation of B‘) associated with ¢). The first assertion follows from [V4], 


Lemma 3 and relation (5). The relation 7+--7,.1 = Cy'+ id follows from 
the definition of ®, and the fact that the element 


Qis =~ es = 05-1 a ‘Qo Qi Q2 irae O34 
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acts by sending (g1,...,9s) to (g1,...,9s)%*. The 7; generate an irreducible 
group by [V1], Lemma 3.3. 


We thus see that ¢-tuples give rise to Thompson tuples and conversely, On 
the other hand we can also attach to such tuples an (essentially unique) 
cyclic covering ¢ : C — P! which is ramified at s = n + 2 prescribed points 
t1,...,t, € WV as follows. 


Lemma 4.15 Suppose (,...,¢,; € Fy satisfy G+ ...-¢, = 1 and di := 
ord(¢;) # 1,1 <i < s. Let K be a field containing a primitive d-th roo} 
of unity where d = lcm(d),...,d,) and suppose that ty,...,t, € K ares 
distinct points. Then there is a cyclic covering 6 = dc: C > Ph anda 
faithful character \: Z := Aut(¢) > F* such that ¢ is unramified outside of 
{t1,...,t,} and is ramified at t; in such a way that the distinguished inertig 
group generator z; at t; satifies \(z;) = ¢;, for 1 <i< s. Furthermore, all 
automorphisms of ¢ are defined over K. 


Proof. This is elementary and well-known, but in order to be able to discuss 
the examples below it is useful to make this precise. Thus, fix a primitive d-th 
root ¢ € F, (which exists by hypothesis), Then there exist unique numbers 
M1,...,M, with 0 < m; < d such that ¢; = ¢™. Now consider the cyclic 
covering ¢: C — P* defined by the equation 


y* = (x —ty)™ (a — ta)" ---(x—t,)™. 


Since (™m,...,7™ms) = 1, this is a covering of degree d and since m,+...+m, = 
0 mod d, it follows that ¢ is unramified at infinity and hence also outside of 
{t;,...,ts}. Now let o be the unique generator of Z :=Aut(¢d) such that 
o(y) = e/4y. Then z; = o™ is the distinguished inertia group generator 
at t;, and so if we let \ : Z — F* be the unique homomorphism such that 


A(a) = ¢, then A(z;) = ¢, as desired. 


Now let Jc denote the Jacobian of the curve C of Lemma 4.15, and let p 
be a prime not dividing d. Then the group Z = Aut(@) acts on the group 
Jo[p] of (K-rational) p-torsion points of Jc, and so Jo{p] is the direct sum 
of its A-components Jo[p],* = Jo[p] @r,[z) V\e, where V\s ~ F,, denotes the 
representation space of the linear characters A* : Z 4 F;, of Z. Since the 
automorphisms z € Z are defined over K’, the Galois action commutes with 
the Z-action, and so Gx acts F,,-linearly on each space Jo[p],« (which has 
a natural F,,-vector space structure). Since Jc[p], has dimension n = s — 2 
over F, (see [V3], or equation(3) below), this yields a homomorphism Gx — 


GL, (q). 


Theorem 4.16 Let ¢ = ((,...,¢€,) be a ¢-tuple in F, satisfying condition 
(1) with the property that F, = F,(G,,...,¢;), and let O, = (01,..., 05-1) be 
the associated Thompson tuple in GLy(q), where n = s — 2. In addition, let 
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_.ests-1 be distinct points in k, and let t, = X be transcendental over k. 
Put K =k(X), and choose accordingly the cyclic covering 


o= 91,6 CG — Pk 


and character \: Z — FY as in the previous Lemma. Let S = Jcelp], be the 
associated A-component of the p-torsion points of Jc, and let N = Ny z be 
the fired field of the kernel of the homomorphism 


PX,Z! * Gr —s GL, (q)/Z' 


induced by the action of Gx on S ~ ee where Z' < Z(GLa(q)) is any 
subgroup containing A(Z). 


(a) The Galois group of the extension kN/k(X) is the subgroup 
GQ! = (oly.06s0%4) S GLn(9)/2’ 


generated by the images o! of the Thompson tuple in GL,(q)/Z'. Moreover, 
the ramification type of this extension is [G',{t1,...,ts-1},C’], where C’ = 
(Ci,...,Cl_1) and C} denotes the class of o{ in G’. 


(6) If G’ is self-normalizing in GL,(q)/Z’, then N is regular over k, and 
hence Gal(N/k(X)) ~ Gal(kN/k(X)) ~ G’. 


Proof. The analogous result for the case that the branch points f)...., ts 
are algebraically independent over k was proved in [V3], Theorem A. So we 
have to carry through a specialization argument which essentially boils down 
to replacing the pure braid group B*) by its normal subgroup generated by 
Me, «2+, Qen1.9: 

As in the proof of Theorem A of [V3], let O, be the space of subsets of C of 
cardinality s, and H(C) the Hurwitz space related to covers of type ¢; and 
let p be a point of H(C) over the point p = {ti,...,t,} of O, such that the 
corresponding cover of P' has ramification of type ¢; over t;. We can proceed 
as in that proof till to step 4. 


There we specialize: O has to be replaced by the curve L defined as fol- 
lows: Let £ be the curve on ©, consisting of all {t1,...,tnai,t}, t € C\ 
Baht e5tn41), and £ the (absolutely) irreducible component of the inverse 
Image of £ in H,(G)'™) that contains p;; here Ay = V- Z’. The funda- 
Mental group 7(Q,,p) can be identified with the braid group B, in such a 
Way that Qis,---,;Qs-1,s correspond to loops around ¢;,...,£;-1 on £ that 
Generate 7,(£,p). Let H be the stabilizer of £ in Ao/A, = GLa(q)/Z’. 
Let &. : BY) — GL,(q)/Z' be the composition of ®¢ (see Lemma 4.14) 
With the canonical map GL,(q) ~ GL,(q)/Z’. As in [V5], Lemma 2.2 
We see that H is conjugate in GL,,(q)/Z’ to the group G" generated by 
%.(Q1.,), ..+;®¢(Qs-1,5). We may assume H = G'. 
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Then similarly as in Step 4, the Galois action of Gkx) on the subspace 
S = FY of the p-division points of the Jacobian of C induces modulo Z’ the 
group H = G’. Furthermore, if we let N’ = kN (which is the fixed field of 
the kernel of the map Gjx) — G’), then the extension N’/k(X) is ramified 
where ¢ equals some ¢;, and /(Q;,s) is a corresponding distinguished inertia 
group generator. This (together with Lemma 4.14) proves part (a) of the 
theorem. 


(b) Note that Gx) is normal in Gx). Thus, if G’ is self-normalizing jp 
GL,,(q)/Z’, then the image of Gix) In GL,(q)/Z' also equals G’, and so the 


corresponding extension N/k(X) is regular over k. 


Remark 4.17 Let A(S)) denote the extension of K obtained by adjoining 
the coordinates of the p-torsion points in S$, C Jo[p] to K. Then clearly 
K(S)) = Ny, is the fixed field of the kernel of the representation 


pr: Gr — Aut(S)) ~ GL, (q) 


on S,, and hence is a Galois extension of K which contains the field(s) 
N = Ny z of Theorem 4.16. Moreover, K(S\) is cyclic over N with Galois 
group Gal{A(5,)/N) < Z’. We shall see below in Theorem 5.14 that the 
extension [{(S))/N) z is always an unramified extension. 


The consequence of Theorem 4.16 is that the almost unramified families of 
covers of P’ constructed in the previous section are obtained by adjoining 
Galois invariant subspaces of torsion points of families of abelian varieties to 
K(X). This imposes strong conditions on these abelian varieties. 

In the next section we shall discuss some consequences of these conditions in 
more detail. 


o <A family of curves with infinite geometric 
fundamental group 


9.1 The abelian variety J?°” 


In order to interpret the results of the previous section geometrically, it is 
useful to introduce the following “new part” J2° of the Jacobian Jo of a 
cyclic covering  : C — C’ of curves, which is (partially) analogous to the 
new part of the Jacobian of the modular curve Xo(NV). 


Definition 5.1 Let 7: C — C’ be a cyclic covering of curves defined over 
an arbitrary field K, and let Z = Aut(7) ~ Z/NZ denote its covering group. 
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reach subgroup H < Z let my : C + Cy = C/H denote the quotient 
ap: Then we call the abelian subvariety 


Id Se . 
Je = THIC/H 
H<Z 
H#l 


the old part of Jc, and its orthogonal complement (with respect to the canon- 
ical polarization on Jc) the new part JZ. 


For our purposes it is important to observe that J@®” and J? can be ex- 
pressed in terms the following idempotent €,.. of the group ring Q{Z]: 


Even d= wl(dyea, 


d|N 


where, for a subgroup H < Z of order d, 


eo, = en == he QZ] 


heH 


Lemma 5.2 If y: Z — C* is a fized character of order N = |Z|, then 


N-1 N-1 
a. | 
ca = Does and enew= D> ext (2) 
k=0 k=1 
d|k (k,N)=1 


where €,% = + do oez x(g)*g71 € C[Z] denotes the primitive idempotent as- 
sociated to the character y*. In particular, Ene is a symmetric idempotent 
of Q(Z] and 

Ed‘Enew = 0, for all d|N, d£#1. 


Proof. Since the y* form a dual basis of the group ring C[Z], it is enough to 
verify (2) after evaluating both sides by y*, for k = 0,..., N—1. Now for any 
class function w on Z we have u(ey) = (li, Wi), O(e,*) = (*,%). Since for 
@ subgroup H of order d we have XH =ly <> Ker(y*)>H <> dik, it 
follows that y*(eq) = 1 if d|k and x"(€q) = 0 otherwise. From this, the first 
formula of (2) is obvious and the second follows readily, using the fact that 
Six aye H(A) = 1, if (k,N) = 1 and s = 0 otherwise. 

hus, by (2) we see that €,., is a sum of pairwise orthogonal idempotents, 
and hence is also an idempotent. Furthermore, €,.,, is symmetric since all €4 
are symmetric. 
Finally, if d £ 1, then (2) shows that ¢, and €,.,, have no common compo- 
Rents ¢,., and hence are orthogonal. 
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Corollary 5.3 Put é.@., = Ne and f.7g = N—€. Then them Sola € End(J,) 
and we have 


JEP Sesuldel ge Sere = Stew Ie). 


Furthermore, Jo = J#¢ + Ja’, and J@7N IG” < Jo[N]; in particulay 
Jo acs Ia x ae hae ; 


Proof. By definition, Nénew € Z[Z] C End(Jc), and so Exew, Soa € End( Je). 
note that: 2,79 ="N (1 —£,.24) = eigdlN u(d)Neg. Put 4 = deg € ZZ). 
Then é4(Jc) = m},(Jo/H,), so it follows from the definition that €oia(Jc) ¢ 
Digan TH (Jo/sHa) = Je?, On the other hand, by Lemma 5.2 we have 
Eouéd = Néa, if d # 1, so wy (Joyn,) = Ea(Jc) = Eota(Ea(Jo)) C Eota(Jo) 
and hence en = Egiaq(Jc). By construction, Enew + Eola = Niytnakscea = 0; 
so JZ’ := Enew(Jc) is the orthogonal complement of Jel, and hence the 
isogeny relation follows. 


Corollary 5.4 Suppose p # char(J‘) is a prime with p= 1 modN. Then 
there exist primitive characters \: Z — F* and oe Z> of order N and 
we have natural identifications (of Z- aed ies and Gr- niodules) 


Jel = ED Ielpls, and 7,08") = QD Tc);e, 
1<k<N 1<k<N 
(k,N)=1 (k,N)=1 


where T,(A) = limA[p"] denotes the Tate-module of an abelian variety A. 


In particular, dim Je" = $0(N)(2gc1 —2 +1), where r = #{PEC'(h): 
ep(7) > 1} denotes a te of ramified primes. 


Proof. By the hypothesis on p, both F, and Z, contain a primitive N-th 
root of unity and so A and \ exist. Thus, if we replace y by X and d in 
(2), the analogous formulae of (2) hold in F,[Z] and in Z,[Z]. From this the 
above decomposition follows because JZ new {p) = = Grew (Jelp)) and Je[pl\. = 
€\x(Jco[p]), and similarly, T,(J2™) = Enewlp(Jo) and T,(Jo)x. = ex Tp(Jc). 
(Note that this is also a decomposition of Gx-modules since the Z-action 
commutes with the G,--action. ) 

To work out the dimension of J2°”, we use the well-known fact (cf. Serre[Se2), 
p. 106) that the character h, of the representation of Z on T,(Jc) © Q, 1s 
given by 

hy =2-l¢y+ (2gc — 2)regz + az 


where az = Dopieo apr, and ap: = (regz — 1p,) denotes the (tame) Artin 
character at P’. (Here Dp is the decomposition group of any P € 1~*(P’), 

and * denotes induction.) From this it follows easily (by Frobenius reci- 
procity) that if k # 0 mod N, then the rank of T,(Jc) x. (= dime, (Je[p],«)) 


18 


(Ay, A*) = (2ac, — 2) + ry, wherer;, := (az, A\*) = #{P’ EC": ep Yk}. (3) 
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ince rank(T,(Je°")) = 2dim(J2°"), the asserted dimension formula follows. 


tiene the abelian variety JZ°”’, we can now give the following geometric 
Using ¥Y Yo gg 
jnterpretation of the main result (Theorem 4.16) of the previous section. 


Theorem 5.5 Let K = k(t), where t ts transcendental over k and k contains 
the N-th roots of unity. Suppose that ty,...,tn4, aren+1 distinct elements 
in k, and put tnyo = 0. In addition, suppose that m,,...,Mn42 are integers 
with 1 < mi < N such that gcd(m,...,Mn42,N) =1, m+... + maz. = 0 
mod N and m; # N — mayo, for1 ¢ign+1. Letx:C — FP" denote the 
cyclic covering of degree N defined by the equation 


y™ = e(a —t))™ +++ (2 — tage) ™*, (4) 
where c € k*, and let A= JZ” denote the new part of the Jacobian Jo of 
C. Then dim A = +0(N)n, and for any prime p=1modWN we have: 


(a) The group A{p] of p-torsion points of A is the direct sum of Gx-invariant 
subspaces S; = Jc[p]\., where (i, N) = 1, and X: Z = Aut(r) > Fy is a 
primitive character. Moreover, Gx acts irreducibly on each S;, and hence 
semi-simply on Alp]. 

(b) Let L; (respectively, M; and M;) denote the fixed field of the kernel 
of the action of Gx on S; (respectively, on S;/\(Z) and on P(S;)), so 
L; > M; > M;. Then L;/M; is a cyclic extension of order dividing N, 
and M;/Ix (respectively, M;/K) is a Galois extension whose geometric part 
G := Gal(kM,/kK) < GL,(p)/\(Z) (respectively, G := Gal(kM,/kK) < 
PGL,(p)) is generated by the image of the Thompson tuple attached to the 
covering. In addition, M; (and hence also M;) is ramified only at ty,...,tn41 
with ramification order dividing N. 


(c) The field K(Alp]) (which is generated over K by the coordinates of all 
p-torsion points of A) ts an abelian extension of the compositum M of all the 
M;’s of degree dividing N*\%). Moreover, M/K is ramified only att,,...,tn41 
with ramification order dividing N. 


(d) If (n,p—1) =1, (and N{6 ifn =3), then M; is regular over k and 
Gal(M;/K) = PGL,(p) = PSL, (p). 


Proof. (a) The first assertion follows directly from the decomposition of 
Corollary 5.4. Next, choose a “canonical generator” o of Z as in the proof 
of Lemma 4.15. Then for any 2 with (1,N) = 1, the hypotheses on the 
m,;’s guarantee that the (n + 2)-tuple ( = A‘(o™),...,€n42 = A (O™"*?) 
Satisfies condition (1) of Lemma 4.14 and so by Theorem 4.16(a) (with Z’ = 
4(GL,(p))) it follows that the image of the Galois group Ggx on P(5S;) is 
Senerated by a Thompson tuple and hence Gx > Gx acts irreducibly on Sj. 
(b) Since Gal(L;/M;) = Gal(L,/K)NA(Z) < \*(Z) (viewed as subgroups 
of Autr, (5;) ~ GL, (p)), the first assertion is clear. The other assertions 
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follow directly from Theorem 4.16(a) by taking Z’ = \*(Z) (respectively, 
Z' = Z(GLa(p))). 

(c) Since Alp] is the direct sum of the S;’s (cf. part (a)), we have K(A[p]) 
I] £:, and so both assertions follow from part (b). 

(d) The hypothesis implies that m is odd and that PSL,(p) = PGL, (p)., 
Thus, from Theorem 4.6 it follows that the image G of the Thompson tuple 
in PGL,,(p) generates the whole group. In particular, G is self-normalizing 
and so M; is regular over k by Theorem 4.16(b). 


—_— 
= 


Remark 5.6 We shall see below in Theorem 5.14 that the extension L;/M, 
ts always unramified; in particular, K(A[p]) 1s unramified over M. 


By applying a variant of the Serre-Tate criterion of potentially good reduction 
(see below), we obtain 


Theorem 5.7 If C/K is as in Theorem 5.5, then the new part A = JE™ of 
its Jacobian Jo has potentially good reduction over K. In other words, there 
is a finite extension Ky of K such that Ax, = A® Ko extends to an abelian 
scheme A over the projective curve Ty with function field Ko, 1.e. for each 
geometric point P € Ty(K’) the fibre of A over P is an abelian variety. 


Proof. By Theorem 5.5 (c), we see that for all primes p = 1 mod N, the 
ramification degrees of the extension K(A[p])/K divide N*%)*1, and so the 
assertion follows from the following criterion: 


Proposition 5.8 (Serre-Tate Criterion) Let K be a field with a discrete 
valuation v, and let A be an abelian variety over K. Then the following 
conditions are equivalent: 


(i) A has potentially good reduction at v; 


(tt) the ramification degree of v in K(A[m]) is bounded for all m prime to 
char(K(v)); 


(iti) there is a constant c such that the ramification degree e,(K(A[p])/K) < 
c, for infinitly many primes p. 


Proof. This is well-known, but for convenience of the reader we present the 
proof. 

(i) > (ii): This follows from the criterion of Néron-Ogg-Shafarevich for good 
reduction; cf. Serre-Tate[ST], Theorem 2. 

(ii) => (iii): Trivial. 

(iii) > (i): By replacing K by a suitable finite extension K’, we may assume 
without loss of generality that A has semi-stable reduction at v. (For ex- 
ample, we could take K’ = K(Al[p]), for any prime p > 3, p # char(«(v)); 
cf. Grothendieck{Gro], Prop. 4.7). In addition, we may assume that A’ is 
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enselian with separably closed residue field. Then the implication (iii) > 
(i) fo 
Claim: If A does not have good reduction at v, then ple,(K(Al[p])/K), 


for every prime p {#(®(Ax)) (p # char(«(v))), where ®(A;,) denotes the 
(finite) group of components of the Néron model of A/K with respect to v. 


To see this, we first note that for p as above, #(A[p]()) = #(A°[p](«(v)) = 

t+2a where t denotes the toric rank and a the abelian rank of the connected 
component A® of the identity of the reduction of the Néron model. (Note 
that t+ a@ = d := dimA, and that ¢ > 0 since A has bad reduction.) On 
the other hand, for K’ = K(A[p]) we clearly have #(Al[p](K")) = p’t, and so 
p|#O(Ax’): Thus, p|#(®(Ax:)/®(Ax)). On the other hand, the quotient 
group ®(Ax:)[p|/®(Ax)[p] has exponent e,(4’/K’); this follows from [Gro], 
Th. 11.5, together with formula (10.3.5). Thus ple,(A’/K’), as asserted. 


llows once we have verified: 


Remark. In the above implication (iii) => (1) we needed several deep results 
from Grothendieck’s article [Gro]. However, all these can be avoided if in 
condition (iii) we can assume in addition that K(A[p])/K is tamely rami- 
fied for the primes p in question. (This is always the case if |GL24(p)| is 
prime to the residue characteristic char(/. ), but for the above application we 
need to assume only that (char(K),.V) = 1 since we already know that the 
ramification degree divides a power of N.) 

In that case the implication (iii) => (i) can be proved as follows: 

Again, it is enough to prove this in the case that K is henselian with sepa- 
tably closed residue field. If £ denotes the compositum of the K(Al[p])’s for 
the primes in question, then the hypothesis (and tame ramification) implies 
that L is a finite (cyclic) extension of K. By construction, L(Al[p]) = L 
for infinitely many p’s, so by the Néron-Ogg-Shafarevich criterion for good 
reduction (cf. [ST], Theorem 1), A has good reduction over L and hence 
potentially good reduction over K. 


Let us now come back to Theorem 5.7. By imposing further restrictions on 
the m,’s, we can conclude that all of Jc has potentially good reduction: 


Corollary 5.9 Suppose that C/K is as above and satifies in addition the 
condition that m; # —Mny2 modd, for every d|N with en4o [d #1, where 
fn42 = Ga: Then Jo has potentially good reduction over K. 


Proof. Induct on the number of the divisors of N. If N is prime, then Jc = 
Je” and so the assertion follows from the theorem. If N is composite, then 
by the theorem it is enough to show that J2!¢ has potentially good reduction 
9r, equivalently, that every Jy := Jcjy, has potentially good reduction (for 
d|N, d #1). This is clear if e := e€n4\d, for then Cy := Cy, is unramified at 
tn42 and hence the covering is already defined over K. Thus, assume € fd. 
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Then the ramification of Cy — P' satisfies the same hypotheses as C > PI 
and so by the induction hypothesis we have that Jc, has good reduction, 
t 5uMzuyN 


5.2 Rational points on A(P(S;)) 


One of the advantages of the above geometric description of the extensions 
generated by Thompson tuples is that it is possible to determine whether o, 
not the extension field has a rational point. This is based on the following 
(well-known) fact. 


Proposition 5.10 Let A be an abelian variety over K which has potentially 
good reduction with respect to a discrete valuation v on K. Suppose that v is 
unramified in K’ = K(A[m]), where m > 3 is an integer which is relatively 
prime to the characteristic of the residue field k = «(v) of v. Then: 


(1) the reduction A, of A at v is an abelian variety over k; 


(11) of v’ denotes any ezrtension of v to K', then its residue field is K(v') = 


k(A,[m)). 


Proof. The first assertion follows from [ST], Theorem 2, Corollary 2(b). To 
prove the second, let k’ = «(v’). Then the reduction map ry : A(K') — 
A,,(k’) induces an isomorphism A[m](K') ~ A,[m](k’), so all m-torsion 
points of A, are rational over k’. Thus k(A,[m]) Cc k’. To prove that equality 
holds, let o € Gal(k’/k(A,[m])). Then o lifts uniquely to an automorphism 
GE roe) where D,(K’/K) < Gal( K'/K’) denotes the decomposition 
group of v’. Since the action of 6 on Alm] is (via r,-) the same as that of 
a on A,[m] = A,{m], this means that @ acts trivially on Alm]. But then ¢ 
and hence a are both trivial, so k’ = k(A,[m]). 


Corollary 5.11 In the above situation, suppose that S C Alm] ts a Gx- 
invariant subspace. Let S denote the image of S in Ay[m]. Then x(t 'K(s)) = = 
k(S) and K(UiK (P(S 
splits completely in KP (S)) if and only if every k-isogeny of A, with kernel 
in S is k-rational. 


P( y= = k(P(S)). In particular, ifm = p is a prime, then v 


Proof. By the same argument as in the proof of the proposition, we see 
that Gal(k’/k(S)) ~ Dy (K'/K) 9 Gal(K’/K(S)) = D,(K'/K(S)), and so 
the first assertion follows. To prove the second, we note that the group 
Gal(A(S)/A(P(S))) consists of those o € Gal (K(S )/K) which act diag- 
ae on S. Since Gal(k($)/k(P(S))) is defined similarly, we see that 
Gal(k(S)/k(P(S))) ~ D ves (S)/K (P(.S))), and so the second assertion 
es 

To prove the last assertion, we observe that by the above result v splits 
completely in K(P(S)) => k(P(S)) = ks everya € Gal(k(S)/k) acts 
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jagonally on S$ <> o(F,3) = F,8, for all s € S and o € Gal(k(S)/k) 
every isogeny y: A, — A’ wah kernel Ner(.v) = F,8 (where s € S—{0}) 
a f-rational <=> every k-isogeny of A, with kernel in S is k-rational. 


The main difficulty in applying the above criterion to our situation is the 
fact that we need to guarantee that v is unramified in A‘( A[p|) for some p. 
Although we already know by Theorem 5.5 the ramification behaviour of the 
extension EN ;))/#y, there remains the problem of understanding that of 

K(S 3) /K(P(S;)). This will be analyzed next by using the following criterion. 


Proposition 5.12 Let A/K be an abelian variety and let v be a discrete 

valuation of IX. Moreover, let I = I, denote the inertia group of an extension 

vy! of v to K’ = K(Alp]), where p # char(x(v)) ts a prime. If S C Alp] is 

a non-zero Gx-invariant subspace and H = Gal(f'/W(P(S))), then ViK(S) 

4 unramified over K(P(S)) if and only if S'™ 4 {0}. In particular, if 
Si # {0}, then vi,.,5) ts unramified over h(P(S)). 


Proof. Let Ho = Gal(A’/A(S)) < H. Then vi,;s) is unramified over 
B(P(S)) — > INA < Hy — INAH acts trivially on S > SIM = S. 
Thus, if vjz(s) is unramified, then clearly S’"” = S # {0}. Conversely, if 
So:= SIM # {0} and g € INH, then g acts diagonally on S and trivially 
on 5o, so g acts trivially on all of S. Thus 1Q H < Mp, and so vjqy5) is 
unramified. 


Corollary 5.13 Suppose in addition that S = Alp], where \: Z — Fr i 

a character on a finite subgroup Z < Aut(A). If either \[p]f # {0} or if 
T,(A i # {0}, where 1: Z — Z” is a lift of \ and I < Gy denotes the 

a olute inertia group of an extention of v (and of v') to K°°, then Vin(sy % 

unramified over N(P(S)). 

In particular, if the reduction A, of the Néron model of A/KX at v is not 

unipotent (i.e. the connected component of A, is not an extension of additive 

groups), then v’ is unramified over K(P(Al[p])) 


Proof. If the second Laas ‘ee then also V := T,( i QF, # {0}. 
But then {0} # V C (T,(A); @ = Alp}, which means that the second 
hypothesis implies - first. - the first hypothesis holds, then S?7"% D 
Ss! # {0}, and so the conclusion follows directly from the proposition. 

In particular, taking for 4 the trivial character (on Z = {1}), then by [Gro]. 
Prop. 2.2.5 (and (2.1.11)) we have rank(7’ (A)! = n—A(A), where n = 
dim(A A) and \(A) denotes the unipotent rank of A,. Thus, 7,,(A)! = {0} <= 

v 1S unipotent, and so the assertion follows from the previous criterion. 


The above criterion can be used in our situation in the following way. 
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Theorem 5.14 In the situation and notation of Theorem 5.5, let v; denote 
the place of K/k corresponding to the specialization t = t;, for1 <i< N44 
Then: . 


(a) If v ts any place of K/k with v ¢ R:= {u,...,Ungi}, then C and hence 
Jo and A= JZ have good reduction at v. 


(b) For eacht with 1 <i <n-+1, let C; denote the normalization of the 
curve 
yN = ce(z os, Th ie eon (x = ee ae . (x 1 t; )™n+2 


and let 7, :C, + Pl denote the associated cyclic covering of degree N. Then 
C; is the normalization of the reduction of C at v;, and the new part JE” of 
its Jacobian has dimension +o(N)(n — 1). 


(c) Let Jy, and A,, denote the reductions of the Néron models of Jc and A = 
JE at v;, and let JO and A® denote their respective connected components 


of the identity. Then there are natural surjections of algebraic groups f, : 
J? — Je, and fre’: Ao = das 


(d) Let l= 1, < Gx denote the inertia group of an extension of v; to K*?, 
Then for any prime p= 1 modWN (with p F char(K’)) and any primitive 
character \: Z — Z%, we have that rank(T,(Jo)§) >n-—1. 


(e) For p as above, let X : Z = F* be a primitive character. Then for 
any 9 with (j,N) = 1, the extension L; = K(S;) ts everywhere unramified 
over M; = K(P(S;)), and hence K’ = K(Alp]) is ramified over K only at 
the places of R with ramification index dividing N and is unramified over 


M =[]M,. 


Proof. Recall that C is by definition the normalization of the projective plane 
curve C’ C Py defined by equation (4). Let C’ denote the closure of C in 
Pp: = P’ x P’, and let v : C + C’ denote its normalization (in «(C’) = «(C)). 
Then, if v is a place of K/k with v # vx, the place at infinity, then the fibre 
C', of C’ is the projective plane curve over «(v) defined by the equation 


y = c(x = a ie we ‘(x = bee : (x = prt, 


where t € k(v) D AW denotes the image of t in the residue field of v. Since 
this curve is reduced and geometrically irreducible, it follows that the same 
is true for the fibres C,, of C, and so the normalization yy, : Cael Ce of 'G;, 
factors over the finite map v, : C, + C!. Note that by considering a different 
affine model, the above argument extends to show that the last assertions 
are also true for v = vy. 

(a) If vu # vu... then the hypothesis on v means that ¢ # t),t2,...t,4,. Thus, 
by Riemann-Hurwitz, the genus of C,, is the same as that of C, and so C has 
good reduction at v. Then Je also has good reduction (cf. [BLR], 9.4/4) and 
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ce so does any quotient A of Jo (cf. [BLR], 7.5/3). Since the argument 
y = Voo is similar, this proves assertion (a). 

Now suppose that v = v;, sot = t; (and K(v) = k). Then C) is the 
projective plane curve defined by the given equation and so by what was 
“aid above, C; = C,, which proves the first assertion. The second assertion 

Beiows directly from Corollary 5.4, for 7; : C; —+ Pj is a cyclic covering of 
degree N which is ramified at precisely r = n + 1 places. 

c) Let &§:C —C denote a desingularization of C (which exists since the base 
P}, is clearly excellent). Then, since C is normal, C; is the normalization of a 
component of the fibre C, of C at v = v;. Thus, we have a natural surjection 
Pic? — Picg.,, (cf. [BLR], 9.2/13). On the other hand, by [BLR], Th. 


5/4, Pico ih is canonically isomorphic to the connected component J° of 


the identity of the reduction J; = J,, of the Neron model of Jo at v = w, 
and so we have a canonical surjection f; : J? — Je,. 

Next we observe that the covering automorphisms o € Aut(7) map isomor- 
phically onto those of 7; : C; — Pj, and so these extend to automorphisms 
of J? and of Jo, in such a way that f; becomes equivariant. From this it 
follows that if €; : Jo, — J&°” denotes the projection map onto the new part 
of Je, (cf. Corollary 5.3), then the composition ¢; 0 f; factors over the map 
Enew,i : Jy — A’, which is induced by the universal property of Néron models; 
i.e. there is a homomorphism f?*” : A), + Jé*“ such that e;of; = fP*“0Enew,i- 
Thus, since f; and <, are surjective, so is f?°“, which proves the assertion. 


(d) By Grothendieck [Gro], Prop. 2.2.5 (and (2.2.3.3)), we have a natural 
identification T,,( A?) ~ T,(A)! (after passing to the henselization of v;). Since 
the action of Z commutes with the Galois action, this induces an isomorphism 
T,(Ap); ~ T,(A){. Now by (c), the map f/"*“ is surjective, hence so is the 
induced map on the Tate modules, and therefore 7;,(A?); — T,(Jc,)x is 
surjective as well. Thus, dim7,(A); > dimT,(Jé,); = n — 1, the latter by 
formula (3) of Corollary 5.4. 

(e) Let v be a place of K/k. If v # v;, then A has good reduction at 
v, so K(A[p|) is unramified over A’ at v, hence a fortiori so is A(S;) over 
A(P(S;)). On the other hand, if v = v;, then in in view of (d) we can apply 
the criterion of Corollary 5.13 to see that UK(S,) is unramified over A (P(S;)) 
for every extension v’ of v. Thus, L; = K(S,) is everywhere unramified over 
M; = K(P(S;)) and hence K(Alp]) = [] L; is unramified over M = [] M; as 
Well. On the other hand, it follows from Theorem 5.5 that each M; and hence 
M is unramified outside R and ramified at each v; € R of order dividing N, 
So the same is true for K’. 


By using the above proposition, we can now make the assertion of Theorem 
9.7 much more precise. 
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Theorem 5.15 Let Ivo be a finite Galois extension of K = k(t) such that the 
ramification index €; = €y,(INXo/K) at v; is divisible by N, forl <i<n4 i 
Then A = J?” has good reduction everywhere over No, and hence Ko( Alm) 
is unramified over No for every integer m relatively prime to char(K). 
Proof. If p = 1 mod N is any prime (p ¥ char(W’)), then by Abhyankar’, 
Lemma it follows from Theorem 5.14(e) that Ko(Al[p]) = K(Alp]) - Wy js 
unramified over Ao. By the criterion of Néron-Ogg-Shafarevich, this means 
that Ax, = A @ No has good reduction everywhere, and hence Ko( A[m]) is 
unramified over [9 for all m which are prime to char(K‘). 


Thus, by above theorem we can now apply Corollary 5.11 to obtain following 
criterion for the existence of rational points: 


Corollary 5.16 Suppose in addition that v = v;, 1s totally ramified in Ko, 
and let A; denote the reduction of A@ Ko at v, where v denotes the (unique) 
extension of v to Kg. If p # char() is any prime and S C Alp] is « 
Gy-invariant subspace such that 


(xx) every isogeny of the reduction Ag with kernel contained in S (= the 
image of S in A,) is rational over K, 


then every extension of 0 to Ms := Ko(P(S)) ts a K-rational point of Ms. 


5.3 Example 


We now return to Example 4.11, but change the notation slightly to conform 
with that of this section. Thus, let k be a field containing a primitive fourth 
root of unity 2 (so in particular char(k) # 2). As before, take n = 3 and (in 
the notation of Theorem 5.5) N = 4,m, =... = m3 = 1,m4 = 3 and ms =2. 
Moreover, let the ramification points be t; = 0,t2 = 1,t3 = —l,ty = a,ts =t 
where a € & \ {0,41} and ¢ is transcendental over k, and put A = &(t). 
Then a corresponding cyclic cover 7: C — Pj. is given by the equation 


Y* = cX(X —1)(X +1)(X — a)*(X — t) with ce k*. (5) 


Clearly, C has genus 4 and deg(z) = 4. Moreover, 7 has a subcover of degree 
2 whose quotient curve is an elliptic curve F given by the equation 


Z? = cX(X —1)(X +1)(X —a) withZ = Y?/((X —a)(X—2)). (6) 


Note that F’ is a constant curve, i.e. E is already defined over k. Moreover, 
f maps injectively into the Jacobian Je of C and can be identified with the 
old part J2/4 of Je (cf. section 5.1). Its complement in Je is therefore the 
new part A= J@°"; thus Jc ~ F x A and so dimA = 3. 
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we choose a cyclic extension Ko of K of degree 4 which is totally 
ified at each of the points of R = {0,1,—1,a}. For example, we can take 
z -(t,s) where 
Ko = k(t, Ss) Ww 

si = t(t? — 1)(t — a)g(t), (7) 
Bad g(t) # 0 is any nonzero polynomial of which does not vanish at R. 
Thus, if To denotes the smooth curve over k defined by this equation (whose 
function field is Ko), then by Theorem 5.15 we obtain 


Proposition 5.17 The abelian variety Ax, = A® Ko extends to an abelian 
scheme A over the curve To. 


Now we want to verify that the tower of unramified covers Ko(P(S,)) of Ko 
attached to the subspaces S, C Alp] for p = 1 mod 4 has rational points 
which all lie over the same base-point v of Ko. 

For this we shall use the criterion («*) of Corollary 5.16 applied to the place 
y = v4 corresponding to the specialization of t to ty; = a. However, in order to 
do this, we need to determine the structure of the reduction Ag := Aj of Ax, 
at the unique extension v of v to Ko. (Recall that by the above proposition 
we know that Ag is an abelian variety over k.) As a first step towards this 
end, we shall prove: 


Proposition 5.18 Let J; denote the reduction of the Jacobian J = Jc ®@ Ko 
atv. Then 

Ji = Jo Xx Ea, 
where Jo is the Jacobian of the curve Co which is the normalization of the 
curve defined by the equation 


¥* = o&( -1(X+1)(% 0) 
and Fy denotes the elliptic curve given by the equation 
Y? = X(X?* - 4d), (8) 
where d = c/g(a) and g is as in (7). 
Proof. First note that since Jc ~ Ex A, and F is a constant curve, it follows 
from Proposition 5.17 that J has good reduction over Ty and in particular at 
v. Thus, if C denotes the minimal model of C at v, then C has semi-stable 
Teduction (use [BLR], 9.5/4, 9.2/5 and 9.2/12). Thus, if C,,...,C, denote 


the irreducible components of the reduction C; of C, then each C; is smooth 


(by [BLR], 9.2/12) and we have (by [BLR], 9.5/4 and 9.2/8) 
Ja= Je, X...X Jes 


Thus, the assertion follows once we have shown that Co and Ea occur as 
Components of C;, for then all other components must have genus 0 (since 
9(C) = 4, 9g(Co) = 3 and g(E,) = 1). For this it is enough to show: 
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Claim: There are normal models C’ and C” of C over D; whose reductions 
Ci, and C% each have an irreducible component Ci, and CZ, with function 
field k(C3,) = K(Co) and «(CY ,) ~ «(£a). 

Indeed, if such a model C’ exists, then its desingularization and hence the 
minimal model C have the same property (because g(Co) > 0). But since all 
components of C are smooth, there is a component C, of C with C, ~ on 
and similarly, if such a model C” exists, then C has a component C, ~ E.,. 


Proof of claim: a) The construction of C’: Let C denote the normal mode} 
constructed in the proof of Theorem 5.14. Its fibre C, at v is integral anq 
has Co as its normalization (because Y4 = cX(X* — 1)(X — a)° also hag 
normalization Co). Thus, if we let C’ denote (the normalization of) CQ Oe 
then its fibre Ct has normalization Co, as desired. 

b) The construction of C”: Let Fo = «(C @ Ko) denote the function field of 
C@ Ko; thus Fo = Ko(X,Y) = k(t,s, X,Y), where X and Y are related by 
equation (5) and ¢ and s by equation (7). Put c = (X —a)/s* and y = Y/s°, 
Then Fo = Ko{x,y) and equation (5) becomes 


ie al Ga - B)*g:(2), 


where B = (t — a)/s* = (t(t? — 1)g(t))"’ € K = k(t) and g(x) = cX(X* - 
1) = c(s4z + a)((s4z + a)? — 1). 

Let & denote the integral closure of B = O;[x] in Fo, and let C” = Spec(Q). 
Fix an irreducible component of the reduction of C”, and let V denote the 
associated (normalized) valution of Fo. We then have: 


V(S¢a,2') = emin(d(a,)), if a; € Ko, 


for some integer e > 1. Since by construction 0(s) = 1, o(t — a) = 4 and 
v(g(t)) = 0, we see that V(x — B) = V(g,(z)) = 0, and so also V(y) = 0. 
Thus, if z and y denote the images of x and y in the residue field «(V) = 
O;/IM; of V, then the above relation specializes to 


y” = aot”(z — bo)’, 

where ag = g;(x) = ca(a? — 1) and by = B = (a(a? — 1)g(a))~!. Since this 
equation is irreducible over k(Z) = K(Vixo(x)), we see that [k(z,y) : k(z)] = 4. 
Thus, since [K(V) : K(Vixq(zy)] < [Fo : Ko(x)] = 4, it follows that n(V) = 
k(z,y). (It also follows that e = 1 and that V is the unique valuation above 
Vixo(z), 1.e. that the fibre of C” at 0 is integral, but we don’t need this.) 

It remains to show that k(z,y) = «(£,). For this, put u = y?/(z(z — bo)) 
and v = agy/u. Then by the above relation we have u? = az, so k(z,y) = 
k(u,v). Moreover, v? = agy?/Z = u(u? — aob), so K(V) = «(Ey), with 
d = agby = c/g(a), as desired. 
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emark 5.19 It follows from the above proof that reduction Cy of the min- 
+a] model C has the form Cy = C, U...UC,, where Cy = Co, C. & Ea, 
ci. cx P’, for 2 <i <r-—1, and each C; meets C;_; transversally in a 
que point for 2<7<r. 


unl 


Next we want to determine the abelian variety Jp up to k-isogeny. To simplify 
matters, we shall assume in the following that 1 — a* is a square in k or, 
equivalently, that @ = oo for some 6 € k, where 6 # 0,41. (In fact, 


b=(1+ (1—4?))/a.) In addition, we shall choose c = (1 + 8). 


Lemma 5.20 Let Co be the curve defined over k by the equation 


26 
Y¥4 = cX(X —1)(X+1)(X —a), wherea= ink andc=1+68?, 
for some b€ k\{0,+1}. Then its Jacobian Jo is up to an isogeny of 2— power 
degree isomorphic to E x E; x Ey, where E is given by equation (6) and Ey 
and Ey2 by equation (8) by putting d=1 and d = b’, respectively. 
In particular, if b = bj is a square ink, then Jo is k-isogenous to E x £y x Fy. 


Proof. Let F = x(Co) = k(X,Y) denote the function field of Co. Then 
F' := k(X,Y?) = «(B) is the function field of E. We shall show that there 
are two subfields F, ~ x(E,) and Fy ~ «(Ey2) of F which, for suitable 
U,V € F, fit into the following field diagram of quadratic extensions: 


PF 
Pn ue, 
Fy Fy F’ 
a ee ee \ 
K(V) k(X) 
* ~ 
k(U) 


From this the assertion follows because the fact that F/k(V) is a Z/2Z x 
Z/2Z-extension implies that Jo = Jp ~ Jp x Jp, x Jp, = Ex FE, x Eye where 
the indicated isogeny has 2-power degree; cf. [KR]. 

To find these subfields, put U = cX(X — a)/(X? — 1), so [k(X) : k(U)] = 2. 
Then 
(U — 1)(X? — 1) = cX(X — a) —(X?-1) = (1 - bX)’ 


because c— 1 = 6? and ca = 2b by hypothesis. Thus, if we put 21 = 
Y(1 — bX)/(X? — 1), then 


ZA. = U(U — 1), 
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and so, if we put F, := k(U,Z,), then [F, : k(U)] = 4 and F,(X) = F. In 
particular, since [fF : k(X)] = 4, we see that Fi MN k(X) = k(U) and that 
[fs Fy] = 2: 

Next, let V = Z?/(U — 1). Then by the above equation V? = U, so ie 
V(U —1) = V(V? — 1). From this we see that k(V) = k(U, Z?), so ae 
k(V)] = 2, and hence F, = k(V,Z,) ~ «(£,). In addition, it follows that 
k(V, X) = F’ because b” is the unique proper intermediate field of F/KV), 
In addition, let us observe that 


(U — ?)(X? — 1) = cX(X — a) — P(X? - 1) = (X — 3), 


because (c — 6*) = 1 and ca = 26 by hypothesis. Therefore, if we put 
X, = (X — 6)/(1 — bX) and Z, = ZX), then X? = (U — b?)/(U — 1) and 
Z} = V(U —1)-(U — &)/(U — 1) = V(V? — B?). From this we obtain for 
Fy := k(V, Zz) that [F : k(V)] = 2 and that Fy ~ x(Ey:). We therefore see 
that Fy, fo, F",k(V) and k(U) fit into the above field diagram as indicated, 
and that Fy ~ «(£,) and Fy ~ «(F,2). 


We can now determine the structure of the reduction Ag = A; of the abelian 
variety A = JZ at v. 

Proposition 5.21 Assume again that a = ats andc = 1+ 8? for some 
bE k*, and let d=c/g(a). Then: 


(a) There is a k-isogeny p : Ag + Ag := EF, x Ey x Eg of 2-power degree, and 
hence Ag is k'-isogenous to EB, x Ey x Ey, where k! = k(V/b, Wd). Moreover, if 
oo € Aut(Ao) denotes the automorphism induced by a generator o € Aut(r) 
of the covering group, then we have yo 09 = 040 for some af € Aut(A}). 


(b) Let S, < Ao[p] be an eigenspace of oo, where p = 1 mod 4 is a prime. 
Then every cyclic k-isogeny of Ag with kernel contained in p(S,) is an en- 
domorphism of Al. In particular, if k' = k, then every k-isogeny of Ao with 
kernel contained in S,, is defined over k. 


Proof. (a) Since A = JZ and FE = Jéi*, we have an isogeny y,: E x A— 
J = Jo of 2-power degree (cf. Corollary 5.3) whose reduction 7, : E x Ap @ 
J; is a k-isogeny of the same degree. Moreover, by Proposition 5.18 we 
have an isomorphism y2 : J; — Jo x Ez, and by Lemma 5.20 we have a 
k-isogeny y3 : Jo —~ E x FE, x Ey of 2-power degree. We thus see that 
y= (y3 x tdg,)9 9209, : Ex Apa Ex Eb, x Ey x Ey = Ex Aj is 
an isogeny of 2—power degree, and so the first assertion follows (by taking 
= $)10}x4,) once we have shown that ~({0} x Ap) = {0} x Aj. 

To prove this, it is enough to find automorphisms og € Aut(Ag) and o@ € 
Aut(Ag) with of = [—1]4, and (04)* = [—1].4, such that 


where 49 = [—1]g X oo and 6}, = [—1]x x of. 
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eed, since 1+0o is an isogeny of Ap (because (1+09)(1—a0) = 1—[-1]4, = 
Hao) we see that (1 + ([-l]z x 90))(E x Ao) = {0} x Ao, and similarly 
1404)(E x Ay = {0} x Ao, and hence ~({0} x Ao) = G((1 +50 )(E x Ag)) = 
4 4G), )O(E x Ao) = (1+ a9)(E x AQ) = {0} x Aj, as desired. 
construct 09 and a with these properties, let a7 denote the automorphism 
; of order 4) on the Jacobian J induced by o € Aut(7). Then oj maps the 
subvarieties E and A of Je into themselves, and (o7)\¢ = [—1]z. Thus, if 
ga = (Fs)\As then we have y; 0 ({-1] x o,4) = oj;041. Thus, if o (resp. 
gp) denotes the automorphism induced by oy (resp. 04) on the reduction Jj 
(resp. on Ao), then we also have , 0 ([-lJz x a0) = 0 0 By. 
Next we note that the proof of Proposition 5.18 shows that o induces au- 
tomorphisms oy and o¢, on Ey and on Co respectively, from which we see 
that 9205 = (a7, X G1) ° 2. Similarly, the proof of Lemma 5.20 shows that 
930 Th) = ((-l]ez x o1 X oy2) 0 Ys, for certain automorphisms o; € Aut(£;) 
where 7 = 1,67. We thus see that if we put og = 01 X o42 X 04 € Aut(A5), 
then the above commutation relation holds. 
It remains to show that of = [—l],, and that (04)? = [-1]4:. The latter 
is clear, for by construction of = 01 X oy X oa, where (for 1 = yO 5G) 
the automorphism o; € Aut(£;) has order 4 and so o? = [-1]z, because 
char(k) # 2,3. To prove the former, recall that A = €newJ, where Ene, = 1— 
07, so 03 acts on A like [—1]4. Thus, 0% = [—1]4 and hence by functoriality 
we have o2 = [—1],,, as desired. 
(b) Write FE, = Ey2 and E3 = Ey, so Ay = Ey, x E, x E3. Let S) = y(S,) < 
Aj|p], which is an eigenspace under the action of 09 = a1 X 02 X 03. 
Since Y/—1 € k and j(£;) = 1728, we have for 2 = 1,2,3 that End(£;) D 
Z[/—1], and equality holds unless F; is supersingular (i.e. unless char(k) = 
3 mod 4). Thus, since p= 1 mod4, there is an a; € End(£;) of degree p such 
that Ker(a;) = S Ne;(E;), where e; : FE; — Ag is the embedding of F; as the 
i-th factor of Aj. Thus, S$) = Ker(a;) x Ker(a2) x Ker(as). 
Now since E and FE; are twists of £,, there are k’-isomorphisms f,; : Ey — 
E;, for 1 = 2,3, which we can choose such that f1;0 a, = a; 0 fy;. Thus, if 
P, € E,(k) is a generator of Ker(a;), and P. = f,,(P,) then {P,, P2, P3} isa 
basis of hs where P; := e;(P,). 
Now let A be a cyclic k-isogeny of Aj with Ker(h) < oe Then Ker(h) is 
generated by P = aP; + bP, + cP3, for some a,b,c € F,. W.l.o.g. we may 
assume that a # 0 (otherwise we interchange the roles of F,, Ez and £3), and 
hence that a = 1. Then the k’-endomorphism h’ : Aj — Ab, defined by the 
{ (1—d)o, agi 0 \ 
Matrix | —bfie dg, 0 has kernel Ker(h’) = (P, + 6P)+c¢P3) = 
\ —¢efis 0 ide, / 
Ker(h), which proves the first assertion. 
From this the second assertion is immediate. Indeed, if k’ = k, then by what 
Was just shown, every k-subgroup of Si is k-rational, and hence the same is 
true for es 
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We can now summarize the results obtained above in the following theorem 


Theorem 5.22 Suppose k contains a primitive fourth root of unity, and les 
: 


c= ere for some 6 € k* with 61 £1. Let K = k(t,s) be the function fielg 


of any curve of the form 
sf = u(t? 1)(t— a)g(t), 


where g(t) € k[t] is any polynomial which does not vanish at {0,1,—1,a}, ang 
which has been normalized in such a way that (1+ 6*)/g(a) is a fourth power 
in k. Moreover, let p be the unique place of K/k inducing the specialization 
tt> a. Then the k-rational geometric fundamental group of K with base point 
p is infinite; more precisely, for every prime p = 5 mod 12 (withp # char(k)), 
there is an unramified extension K,/K with Gal(K,/K) ~ PSL3(p) in which 
p splits completely. 


Proof. It is clearly enough to prove the last statement. For this, let C be 
the curve defined over k(t) by the equation Y? = (1 + b4)X(X? — 1)(X ~ 
a)°(X — t)*, and let J denote its Jacobian and A = J"*” its new part (cf. 
section 5.1). Since the hypotheses on a, c = 1 + 6? and g ensure that k’ := 
k(Wb?, * c/g(a)) = k, it follows from Proposition 5.21a) that the reduction 
Ao of A@ K at p is k-isogenous to Ey x Ey x Ej. 

Now let p = 5 mod 12 be any prime (and # char(k)), and let S, < Alp] denote 
an eigenspace of the automorphism oy (cf. proof of Proposition 5.21). By 
Theorem 5.5(b),(d) (and Abhyankar’s Lemma) we know that K, := K(P(S,)) 
is unramified over K and that Gal(K,/K) = PSL3(p). Let S, denote the 
image of S, in Ao{p] under the reduction map at p; clearly, S, is identical to 
the set S,, as defined in Proposition 5.21(b). Thus, by that proposition, con- 
dition (x*) of Corollary 5.16 holds, and so it follows that p splits completely 
in K,. 


Remark 5.23 The simplest examples of fields AK satifying the hypothesis of 
Theorem 5.22 are clearly the fields K = k(s,t), where 


ee 4) 4/42 b? 
sf = (14 54)t(2 —1) (t- 2) 
and 6 € k*™ is any element with 64 # 1; recall that fields of this type were 
studied in Lemma 5.20. Clearly, all these fields have genus 3. More generally, 
by choosing g(t) suitably, we can obtain examples of fields of any genus g > 3 
which satisfy the hypotheses of Theorem 5.22. 
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Embedding Problems and Adding Branch Points 


David Harbater* 


Abstract: If Y — X is a G-Galois branched cover of 
curves over an algebraically closed field k, and if G is a 
quotient of a finite group [, then Y — X is dominated 
by a [-Galois branched cover Z — X. This is classi- 
cal in characteristic 0, and was proven in characteristic 
p by the author [Ha6] and F. Pop [Pol] in conjunction 
with the proof of the geometric case of the Shafarevich 
Conjecture on free absolute Galois groups. The resulting 
cover Z — X, though, may acquire additional branch 
points. The present paper shows how many new branch 
points are needed, and shows that there is some control 
on the positions of these branch points and on the inertia 
groups of Z — X. 


Section 1. Introduction and survey of results. 


This paper concerns an aspect of the fine structure of the fundamental group 
of an affine curve U/ over an algebraically closed field k of characteristic p. 
In [Ra], [Ha3], it was shown which finite groups G are quotients of 7,(U) 
— namely, according to Abhyankar’s Conjecture, the set of such G depends 
only on the pair (g,m), where g is the genus of the smooth compactification 
X of U and n is the number of points in X — U. But the structure of the 
profinite group 7,(U) remains a mystery, even in the case of the affine line. 
Moreover, the group 7(U) (unlike the set 14(U) of its finite quotients) 
does not depend just on (g,n) (cf. [Ha4, §1], [Ta, Thm. 3.5]), though it is 
unclear how it varies in moduli. In the current paper we study the structure 
of 7;(U) by investigating how the finite quotients of this group fit together, 
and how 7(U) grows as additional points are deleted. 

A preliminary result in this direction appeared in [Ha6], [Pol], in con- 
nection with proving the geometric case of the Shafarevich Conjecture. 
Namely, it was shown there that the absolute Galois group Gx of the func- 
tion field K of U is a free profinite group (of rank equal to the cardinality 
of k). This was done by showing that every finite embedding problem for 
K has a proper solution, i.e. that if [.—G is a surjection of finite groups, 
then every unramified G-Galois cover V — U of affine curves is dominated 
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by a T-Galois branched cover W — U. In fact, the proof (which used Patch. 
ing techniques in formal or rigid geometry) showed a bit more — Viz. it 
bounded the number of branch points of W — U. That bound was Not 
sharp, however, and here we obtain the sharp bound (Theorem 5.4 below), 

More precisely, let G=T/N, let p(N) be the subgroup of N generated 
by its p-subgroups (so that N = N/p(N) is the maximal prime-to-p quotient 
of N), and let r be the rank of N (i.e. the minimal number of elements in 
any generating set). In [Ha6, Theorem 3.5] it was shown that W — 1 as 
above can be chosen with at most r+1 branch points; and it was asked if it 
is always possible to choose W — U with at most r branch points. (It is not 
in general possible with only r—1 branch points even in characteristic 0, as 
topological considerations show; and that implies the same for characteristic 
p.) In [Pol], it was shown that this is always possible in the case that r — 0, 
thus answering [Ha6, Question 3.7]. So if N above is a quasi-p group (i.e. 
is generated by its p-subgroups, or equivalently if N = 1) then the cover 
W — U can be chosen to be unramified. 


Here we show that for arbitrary r (not just r = 0), the dominating 
cover W — U can be chosen with at most r branch points (where as above, 
r = rk(N)). In fact, we show a bit more. Namely, for finite group T 
and normal subgroup N of I, we will define the relative rank of N in T, 
denoted rkp(N). This will be a non-negative integer that is < rk(N) (but 
is often strictly less). What we will show is that in the above situation, the 
cover W — U can be chosen with at most rkp(N) branch points, where 
T= I'/p(N). By using the r = 0 case, the proof of this result is reduced 
to the case that N is of order prime to p; and there we use methods of 
patching and lifting. In addition, we show that there is often control over 
the positions of the new branch points, and over the inertia groups of the 
resulting cover (cf. Props. 3.3, 3.5, 4.1, 5.1). 

The results in this paper can be phrased in the language of embedding 
problems. This and other group-theoretic notions (along with some notions 
about covers) are discussed in Section 2. Then, in Section 3, we use formal 
patching to prove the above result in a key special case (when N has order 
prime to p, and one of the branch points of Y — X is tame, where Y > X 
is the smooth compactification of V — U). In Section 4, we use a lifting 
result of Garuti [Ga, Theorem 2] to prove the above result in the case 
that rkp(N) < 1, again assuming that N has order prime to p. Section 5 
combines the two special cases, and applies Pop’s result [Pol] in the case 
r =), to prove the full result (Theorem 5.4). 


[I would like to thank Claus Lehr, Rachel Pries, Kate Stevenson, and 
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Ection 2. Notions concerning covers and groups. 
. ‘this paper we work over a fixed algebraically closed field k of characteristic 
|, > 0, and consider covers of k-curves. A cover will be a finite generically 
separable morphism Y — X of k-schemes, where X is connected. If G is a 
fnite group, then a G-Galots cover consists of a cover Y — X together with 
, homomorphism p : G — Aut(Y/X), such that G acts simply transitively 
on any generic geometric fibre of the cover, via p. (The top space Y is not 
required to be connected. For example, the trivial G-Galois cover of X is 
a disjoint union of copies of X indexed by the elements of G, on which G 
acts by the regular representation.) 

Since k is algebraically closed of characteristic p, there is a primitive 
mth root of unity ¢,, € & for each positive integer m not divisible by p. Here 


/ 
m — 


we may choose the elements ¢,, to be compatible; i.e. such that (7, = 


Gm for all m.m'. From now on, these will be fixed. For any G-Galois 
cover 7 : Y — X of smooth connected k-curves and any tame ramification 
point 7 lying over a branch point € € X, the corresponding extension of 
complete local rings is given by y” = x, for some choice of local parameters 
r,y. The inertia group is generated by c : y > Gmy, and the element 
ec € G (which is independent of the choice of local parameters) is called 
the canonical generator of the inertia group at 7. (Here and just below, 
we follow the terminology of [St] and {HS, §§2,3].) If all the branch points 
of Y — X are tame, and if the branch points are given with an ordering, 
say £;,...,&,, then we say that the cover has description (c1,...,¢-), where 
¢; is a canonical generator of inertia at a point over €;, and where each 
¢; is determined up to (individual) conjugacy. In the case that k = C 
and ¢,, = e°7'/™, the fundamental group of U = X — {&,...,€-} has 
presentation 
g r 
Pe Sa je OG Oi poiy OB Cine [] (a. od T] ere 1). '(#) 
i=1 jel 

Where g is the genus of X. Here the G-Galois cover corresponding to a 
Surjection @ : 7(U) — G has description (4(c1),...,@(c,)). If p does not 
divide the order of G, then this also holds for an arbitrary algebraically 
Closed field k of characteristic p > 0, via the same presentation (+) of the 
Maximal prime-to-p quotient 7(U)? of 74(U) [Gr, XIII, Cor. 2.12]. (This 
Presentation arises via a specialization morphism between k and C, which 
Should be chosen so that the given roots of unity ¢, € k& correspond to 
e@ti/m © ©. Cf. (Gr, XIII] and [GM, Thm. 4.3.2, Lemma 4.1.3].) 
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Consider more generally a G-Galois cover Y — X of a semi-stabj. 
k-curve X (i.e. X is connected and its only singularities are nodes). Let 
Y — X be the pullback of Y — X over the normalization X of X. We Say 
that Y — X is admissible if for each singular point 7 € Y, the canonical 
generators at the two points 7;, 2 € Y over 7 are inverses in G. A thickenin 
of Y — X is a G-Galois cover of normal k[[t]]-curves Y* — X* whose closeq 
fibre is Y — X, and whose completion along the smooth locus of Y is a 
trivial deformation of its closed fibre. If Y — X is an admissible COVey, 
then such a Y* — X" is called an admissible thickening of Y — X if at 
the complete local ring at every singular point of Y the cover is given by 
the extension k[[t, 21, r2]]/(xit2—t™") — k[[t, y1, yol]/(yiy2 —t) for some m 
prime to p, where x; ++ y!” under the inclusion, and where an associated 
canonical generator of inertia acts by r+ Gnz, y+ Gly. Observe that ip 
this situation, the singular points of the closed fibre X are isolated branch 
points of Y* — X™* (and this does not contradict Purity of Branch Locus 
since X* is not regular and Y* is not flat over X*). Since these points 
are branch points of the irreducible components of Y — X, the process of 
constructing an admissible thickening can be regarded as a way of patching 
together these components in such a way that some of the branch points 
“cancel” on the general fibr! e (and cf. [HS, Thm. 7]). This ob servation will 
be key to the results of §3 below, and thus to the paper’s main theorem, by 
yielding a cover with fewer branch points than would otherwise be expected. 


The remainder of this section is devoted to discussing some group- 
theoretic notions that will be used in this paper. 

If f is any finite group, then (following {[FJ]), we define its rank to be 
the smallest non-negative integer r = rk(T’) such that T has a generating 
set of r elements. (In the literature, this integer is also sometimes denoted 
by d(I).) More generally, let FE be a subgroup of a finite group I’. A subset 
S Cc E will be called a relative generating set for E in T if for every subset 
T CT such that EUT generates I’, the subset S UT also generates [. We 
define the relative rank of F in IT to be the smallest non-negative integer 
s = rkp(&) such that there is a relative generating set for EF in T consisting 
of s elements. Thus every generating set for E is a relative generating set, 
and so 0 < rkp(£) < rk(#). Also, rkp(#) = rk(£) if E = 1 or E =T, while 
rkp(£) = 0 if and only if E is contained in the Frattini subgroup (T) of I. 

A related notion is the following: Let G be a subgroup of a group I. 
A subset T CT is a supplementary generating set for T with respect to G 
if T UG generates [. Suppose that [ is a finite group that is generated 
by two subgroups E,G. We then define the relative rank of E CT with 
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ct to G to be the smallest non-negative integer t = rkp(#,G) such 
there 1s a supplementary generating set 7’ for T with respect to G, 
7 of cardinality t and 7’C E. Note that every relative generating set 
£ in I is a supplementary generating set for [ with respect to G. So 
rkr(E, G) < rkp(E) < rk(£). 

If p is a prime number, then the quas?-p part of a finite group T is the 
pgroup p({) CL that is generated by the p-subgroups of I (or equiva- 
jently, by the Sylow p-subgroups of [). Thus p(T) is a characteristic sub- 
group of f, and in particular is normal. A group I is defined to be a quasi-p 
group if f = p(f). Thus for any finite group I, the subgroup p(T) is a quasi- 
p group and ['/p(T) is the (unique) maximal quotient of [ whose order is 
not divisible by p. (In the other case, viz. p = 0, we set p(T) = 1.) 


If 11,1, H are groups (not necessarily finite), then an embedding problem 
for II consists of a pair of surjective group homomorphisms € = (a: Tl — H. 
f:°— #). A weak solution to the embedding problem consists of a group 
homomorphism ( : I] —T such that f3 = a. If moreover @ is surjective, 
then it is referred to as a proper solution to the embedding problem. An 
embedding problem is finite if [ is finite. The motivation for the notion of 
embedding problems comes from Galois theory: If AK C L is a Galois field 
extension with group H, and if II is the absolute Galois group Gx of K, 
then Galois theory yields a corresponding surjection a: Gn-—»H. Let f : 
[+H be a surjective homomorphism of finite (or profinite) groups. Then 
a proper [resp. weak] solution to the embedding problem (a, f) corresponds 
to a I-Galois fiel! d extension of K [resp. to a Gamma-Galois K-algebra] 
containing the H-Galois extension L, such that the actions of f and H are 
compatible with the surjection T—-H. That is, the H-Galois extension L is 
embedded in a I'-Galois extension via a solution to the embedding problem. 


Observe that if @ : [I/II is a surjective homomorphism of groups, 
then every embedding problem for II induces an embedding problem for TI’. 
Namely, if € = (a: Il — A, f :T — H) is an embedding problem for II, then 
there is an induced embedding problem €' = (a’ :- Tl’ + H,f :T — A) for Il’, 
Where a’ = a@. Moreover, a weak or proper solution to the given embedding 
Problem induces such a solution to the new problem. On the other hand, 
hot every solution to the new problem need come from a solution to the 
Original problem. These observations will be useful later, when considering 
the fundamental groups II = 7,(U) and II’ = 2,(U’) of two affine curves 
U’ CU. In that context, solutions to embedding problems for II correspond 
to certain unramified covers of U, whereas solutions to embedding problems 
for TI’ are required merely to be unramified over U’. 
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As above, let € = (a: I] ~ H,f : T — H) be an embedding Proble 
for II. If the exact sequence 1 — N —T — H — lissplit, where N = kep 
then we say that € is a split embedding problem. A split embedding Probleyy 
€ = (a, f) always has a weak solution, viz. sa: I] +I, where s isa secti 
of f — H. Often, finding proper solutions to embedding problems cap 
reduced to doing so for split embedding problems — e.g. see [FJ, §20.4 
(Ha3, proofs of Thm. 5.4, Prop. 6.2], [Ha6, proof of Prop. 3.3], and [Pog 
§1B(2)]. For the sake of completeness, we conclude this section with ; 
precise statement of this reduction, in a form that can be cited late; (in | 
sections 3 and 5 below). 
Proposition 2.1. Let € = (a: ll — H,f :T — H) be an embedding . 
problem, and let N = ker f. Suppose that € has a weak solution ao : II + p. 
and let Hy C IT be the image of ag. Consider the semi-direct product 
To = Nx Apo, with respect to the conjugation action of Hy on N <I, and let 
fo : To—Ho be the natural quotient map. If the split embedding problem — 
Eo = (ao : I] > Ao, fo: Po — Ho) has a proper solution, then so does &. 


Proof. Since ap is a weak solution to €, we have fag = @; or equiva- 
lently fap = a, where fj : Hg — A is the restriction of f : T—H to Ap. 
Since f has kernel AN, and since its restriction f|y, = / 1s surjective onto 
H (because fag = a is), it follows that [ is generated by N and Ao. Let 
we: Po — F be the homomorphism defined by taking the identity inclusion 
on each factor of fg = Nx Hp. (This is a homomorphism since the con- 
jugation action of Hp on N in Tp is the same as the conjugation action of 
Hy on N in YT.) Then p is a surjection since N and Ho generate I, and it 
is straightforward to check that fu = pfo. We thus obtain the following 
commutative diagram (where as above fay = a: II — A): 


I] 


| ———> N — sr —-— > A —_ 1 
So any proper solution $9 : I]—-Tp) to the split embedding problem & = 
(ao : Tl] — Ho, fo : To — Ao) yields a proper solution # : II+I to the 
original embedding problem &, viz. 6 = po. O 
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Jn patticular, the reduction in the above proposition can be always 
mplished in the case that the group II is projective (which by definition 
§20.4] means that every finite embedding problem for II has a weak 
tion). Indeed, in that situation, the given embedding problem € has a 
k solution ao, and so the above hypotheses are satisfied. 


ction 3. Results via patching. 


. is section uses formal patching methods in order to prove the main result 
the paper in a special case. Namely, we consider a finite group T and a 
‘quotient G = I'/N, together with a G-Galois étale cover of smooth affine k- 
‘curves V — U (where, as always, & is algebraically closed of characteristic 
p2 0). We consider the smooth completions X,Y of U,V, and assume 
that Y — X is tamely ramified at some branch point €. We will also 
assume that p does not divide the order of N. In this situation, we will 
show that there is a [-Galois cover W — U dominating V — U, having 
at most rkp(NV) branch points, and with specified inertia groups over those 
points (Prop. 3.5). This solves a certain embedding problem (Cor. 3.6). 
Moreover we will obtain greater control on the number of branch points 
of the constructed cover and on the inertia groups over X — U in the case 
that the embedding problem is split (under an additional assumption on 
normalizers). Cf. Prop. 3.3 and Cor. 3.4. A more general and more precise 
version of these results appears first, in Prop. 3.1 an Cor. 3.2). 

Patching methods, in formal or rigid geometry, have previously been 
used to prove a number of results concerning fundamental groups of vari- 
eties, especially for curves in characteristic p — e.g. [Hal], [Ha2], [Ra,§§3-5], 
[St], [Sa], [Ha6], [Pol], [HS]; see also [Ha5, §2]. The basic idea is to build a 
simpler, but possibly degenerate, cover with similar properties, and then to 
deform it to a family of covers whose generic member is as desired. In order 
to reduce the number of branch points of the cover we construct here, and 
thus achieve the desired sharp bound on that number, we will use a con- 
struction involving admissible covers; cf. §2 above and the remark following 
Proposition 3.3 below. 


Below we preserve the terminology of Section 2, and begin with an 
assertion concerning the problem of modifying a cover so as to expand its 
Galois group. (Cf. also [Ha2, Theorem 2] for a related result.) Note that 
here, and in the next few results, it suffices to use the value rkp(Z,G), 
Tather than having to use the possibly larger value rkp(F). 


Proposition 3.1. Let T be a finite group generated by two subgroups 
G,E, where p does not divide |E|, and let r > rkp(E,G). Let V + U 
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be a G-Galois étale cover of smooth connected affine k-curves with SMOo¢h 
completion Y — X. Suppose that Y — X is tamely ramified over Some 


point € of B = X — U, and that some inertia group over € normalizes E 


Then there is a smooth connected T'-Galois cover W — U having at most » ' 


branch points. 


Moreover, if {e1,...,¢>} C E is a supplementary generating set for ra ’ 


with respect to G, then the above cover can be chosen so that: 

(i) The H-Galois cover W/N — U agrees with V/(NIG) > U, where 
N is the normal closure of E inT and where H =T/N = G/(NNG). 

(ii) There are inertia groups of W — U over the branch points &1,..., ¢, 
having canonical generators €,,...,€r, respectively. 

(iii) Each inertia group of Y — X over any point x € B — {&} is also 


an inertia group of Z —+ X over y, where Z is the smooth completion of W. 


Proof. Let R = k{(t]], let X = X x, R, and let X* be the blow-up of X at 
the closed point of € = £x,R. Thus X* is a regular two-dimensional scheme 
that is projective as an R-curve. Its closed fibre Xo is connected and consists 
of two irreducible components: a proper transform that is isomorphic to X, 
and an exceptional divisor that is isomorphic to P}. These two components 
meet at the point on the proper transform corresponding to € on X, and to 
the point s = 0 on the projective s-line P}. (Here we take s = t/x, where 
z is a local parameter for X at €. Thus the locus of (s = oo) is the proper 
transform of €.) 

Let {e1,...,e,} C E be a supplementary generating set for [ with 
respect to G, and let o;,...,0, be distinct points of Pi other than s = 0, ™. 
By hypothesis we may choose a point n € Y over € € X for which the inertia 
group | C G normalizes FE. Let g € G be the canonical generator of the 
inertia group I. Thus the subgroup Eg CT generated by EF and g is of the 
form Ep = ExT, and hence its order is not divisible by p. Let Ey C Eo be 
the subgroup generated by e),...,é,,g, and let h = (e; ---e,) 1g. Thus p 
also does not divide the order of E;, and g~‘e,+:-e,h = 1. As discussed in 
§2 above (and cf. [Gr, XIII, Cor. 2.12]), there exists a smooth connected 
E,-Galois cover M — Pi branched at 0,01,...,0,,00 with description 


-1 is a canonical 


(g~*,e1,...,€p,h). Let p € M bea point over 0 at which g 
generator of inertia. Consider the induced (disconnected) [-Galois covers 
Ind Y — X and Ind, M — P}, consisting of disjoint unions of copies 
of ¥Y — X and M — Pj, respectively, indexed by the cosets of G and 
of E, in T. We may identify Y and M with the identity components of 
the respective induced covers. Identifying the two points y(7) € Indy Y 


and y(t) € Indb, M for each y € I, we obtain a I'-Galois cover Zp of the 


' 
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ducible curve Xo. Moreover Zp is admissible over Xo by construction, and 
connected since G and E; generate I (and cf. also [HS, §4, Pro! p. 2)). 
By [HS,§2, Cor. to Thm . 2], there is a P-Galois cover Z* — X™* which is 
gn admissible thickening of Z) + Xo (viz., in the terminology of [HS], the 
nique solution to the corresponding relative thickening problem). 

Let Z° — X° be the fibre of 7* — X™* over the generic point of 
Spec k{[t]]. Since X™* is the blow-up of X = X x, k[[t]] at the closed point 
of £, there are isomorphisms of K-curves X° ~ X x, K w X* xq K, 
where K = k((t)). Since Z* — X™* is a thickening of Zp — Xo, the cover in 
particular restricts to a trivial deformation of the restriction of Ind; YX 
to X —{&}. Hence Z° — X° is branched at the points of (B—{&})x;,K, with 
the same inertia groups as the corresponding points of B — {£} for Y — xX; 
and it is branched at no other point of X° except for those whose closure 
in X passes through the point (€,(f = 0)). Among points of the latter 
type, Z° — X° is branched precisely at r+ 1 points of,...,02,00° whose 
closures oj,...,07%,C!nfty* in X* pass through the points o1,...,0,, 00. 
(Note that the singular point of Xo is an isolated point of the branch locus 
of Z* — X”*, as discussed in §2; so it does not contribute to the branch locus 
of Z° — X°.) Over the point o%, the inertia groups of 7° — X° are the 
same as those of Z* — X™* over o*, and one of them has canonical generator 
e;. Here the closure of 00° in X is € = £ x; R. So under the isomorphism 
X x; K = X°, the branch locus consists of the r points 7? and the points 
of B x; K (with € x; K corresponding to the point oo° in X°). Also, in the 
special case EF — 1, the cover Z° — X° is just the base change of Y — X 
from k to K. Since the above construction commutes with taking quotients, 
we deduce for arbitrary © that the cover Z°/N — X° is the base change of 
Y/(NNG) — X from k to K. 

Thus 7° — X° x X x; K is asmooth connected [-Galois cover whose 
restriction W° — U° := U x; K has the desired properties for W. but over 
K instead of over k. Being of finite type, this cover descends to a smooth 
connected [-Galois cover Z4 — X4 := X x, A over some finitely generated 
k-algebra A C K, whose restriction to U4 := U x, A has the corresponding 
Properties over A. Here Spec A is an absolutely irreducible variety, since 
AC K and k is algebraically closed. By [Ha2, Prop. 5] (or [FJ, Props. 8.8, 
9.29]) we conclude that the specialization Z, — X of Z4 — X4 at a k-point 
V € Spec A restricts to a G-Galois cover W := Z, xx U — U having the 
desired properties. O 


Using the notion of embedding problems (cf. §2), we may rephrase 
Proposition 3.1 in more group-theoretic terms. In particular, we have the 
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following corollary. In this connection, we recall that an inclusion U’ < U 
of affine curves induces a surjection 7,(U’)—»7,(U). 


Corollary 3.2. Let X be a smooth connected projective k-curve, let U CX 
be a dense affine open subset, and let € € X —U. Let Il = ™1(U) and ley 


| 


II* be the quotient of II corresponding to covers whose smooth completions 
are tamely ramified over €. Let € = (a: Il* + H,f :T — H) bea finite | 


embedding problem for II", and let 8 be a weak solution to €. Suppose 
that T is generated by G,E CT, where E is a subgroup of ker(f) with D 
not dividing |E|. Suppose also that the normalizer of FE in T contains (I), 
where I C II" is an inertia group over €. Let r > rkp(E, G). Then there js 
an open subset U' C U such that U — U' has cardinality r and the induced 
embedding problem €' for lI’ = 1,(U') has a proper solution. 


Proof. Let N be the normal closure of F in lr. Since EC ker(f), it follows 
that N C ker(f), and H is a quotient of the group Hy :=T'/N = G/(NNG). 
Let f; : TH, and fo : G-»H, be the natural quotient maps, and let 
a, = fof : I*—H,. Replacing € = (a : I* — H,f : T — BF) by the 
embedding problem €; = (a; : II* — Ay, f; : T — Hy), we may assume 
that H = Hj, that f|g = fo, and that a = ff. 

Now #3 is a proper solution to the embedding problem & = (a: II* = 
H, fo: G— H), where fo = flg : G + H. Under the Galois correspon- 
dence, the surjection @ : II*+G corresponds to a connected étale G-Galois 
cover V — U whose smooth completion Y — X is tamely ramified over 
€, and such that some inertia group over € normalizes E. By Proposi- 
tion 3.1, there is a smooth connected [-Galois cover W — U having at 
most r branch points, such that there is an isomorphism of H-Galois covers 
W/N = V/(N MG) of U. (Here, as above, N is the normal closure of E in 
Pr, and H =V/N = G/(NNG).) So over the complement U’ C U of the 
r-point branch locus of W — U, we obtain a T'-Galois étale cover W’ — U' 
corresponding to a proper solution to the embedding problem €’. 0 


Remarks. (a) The above corollary does not rely on the full statement of 
Proposition 3.1, since neither (ii) nor (iii) there are used. But if II’ is re- 
placed by a suitably refined quotient II’/~ (containing additional information 
about inertia groups), then a corresponding result can be proven, with the 
aid of (11) and (iii) of 3.1, about embedding problems for TI’"; and this would 
correspond to the full content of 3.1. 

(b) In the other direction, it would be desirable to state a version of 
Corollary 3.2 just for II, rather than for II*, and without assumptions on 


normalizers. Correspondingly, it would be desirable to state a version of 
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oposition 3.1 without the assumptions on tameness or normalizers. (The 
oof of 3.1 at least shows that it is possible to weaken the assumption 
at an inertia group J over € normalizes E', by instead assuming that EF, I 
generate a prime-to-p subgroup of I.) 

(c) The proofs of the above results, and those that follow in this section, 
require the base field k to be algebraically (or at least separably) closed, 
pecause of the use of [Gr, XIII, Cor. 2.12] in the proof of Proposition 3.1. 
{n particular, the condition that k be large (cf. [Po2]) does not suffice, at 
Jeast for the proofs here. See also Remark (b) at the end of Section 4 below. 
In particular, in the split embedding problem situation, the above re- 


sults give rise to the following proposition and corollary: 


Proposition 3.3. Let I be a finite group of the form NxG, where p does 
not divide the order of N, and let {n,,...,n-} C N be a supplementary 
generating set for T with respect toG. Let V — U ke a G-Galois étale cover 
of smooth connected affine k-curves whose smooth completion Y — X is 
tamely ramified over some point € of B = X —U. Then there is a smooth 
connected T'-Galois cover W — U branched only at r points €),...,&, with 
smooth completion Z — X, such that: W/N x» V as G-Galois covers of 
U; the element n; is the canonical generator of an inertia group of W — U 
over €;; Z —> X is tamely ramified over €; and each inertia group of Y + X 
over any point x € B — {£} is also an inertia group of Z — X over x. 


Proof. Since N is normal in I, any inertia group of Y — X over € must 
normalize N. So Proposition 3.1 applies, with EF = N, and with the H 
of Proposition 3.1 being the same group as G here. This yields the result 
(with tameness over € following since Z/N = Y and p does not divide the 
order of N). O 


Remark. In the special case that the cover Y — X has trivial inertia 
Stoups over € (so that the given tamely ramified point is not actually a true 
branch point), the assertion of Proposition 3.3 is closely related to [Haé, 
Theorem 3.5] (by taking the point €9 of [Ha6, Theorem 3.5] to be € above), 
and the proofs are also related. But in the general case, the assertion of 
[Haé, Theorem 3.5] is weaker than Proposition 3.3 above, since it requires 
an extra branch point (beyond the r points in Prop. 3.3). The difference is 
that in the result above, admissible covers can be used to avoid adding the 
€xtra branch point, provided that we have a tameness assumption. (The 
Tesult in [Ha6] also uses a weaker notion of rank.) 


Corollary 3.4. Let X be asmooth connected projective k-curve, let U C X 
be a dense affine open subset, and let € € X —U. Let Il = ™(U) and let 


130 HARBATER: Embedding problems and adding branch points 


II" be the quotient of II corresponding to covers whose smooth completion. 
are tamely ramified over €. Consider a finite split embedding problem ‘iad 
(a: T* —G,f:T — G) for II’, such that p does not divide the order of 
N = ker(f). Let r > rkp(N,«(G)), where 1: GT is a section of f. They 
there is an open subset U' C U such that U — U’ has cardinality r and tae 
induced embedding problem €' for II! = 7(U') has a proper solution, 


Proof. Identifying G with its image under t, we may identify [ with the 
semidirect product NxG. By the assumption on rank, there is a supple. 
mentary generating set {n,,...,n,-} C N for T with respect to G. Also, the 
homomorphism a : II*-+G corresponds to a G-Galois connected étale Cover 
of affine k-curves whose smooth completion Y + X is tamely ramified over 
€. So the hypotheses of Proposition 3.3 are satisfied, yielding a T-Galois 
cover W — U that is étale over some U’ C U with U — U’ of cardinality r, 
This cover corresponds to a homomorphism [I’—*T that is a solution to the 
induced embedding problem &’. oO 


Remarks. (a) The proper solution to €' in 3.4 is automatically a proper 
solution to the induced embedding problem for II’*, the quotient of II’ cor- 
responding to covers of U’ whose smooth completions are tamely ramified 
over €. As in 3.3, this is because p does not divide the order of N. 


(b) Corollary 3.4 can also be deduced directly from Corollary 3.2, by 
taking H =G, N = E, and # =a. 


As discussed in Section 2, results about split embedding problems for a 
group II can sometimes be extended to results about arbitrary embedding 
problems for II, e.g. in situations in which the group II is projective. By [Se?2, 
Proposition 1], the fundamental group of an affine k-curve has cohomological 
dimension < 1; and hence it is a projective group [Sel, 1.5.9, Prop. 45]. 
Using this projectivity, we obtain the following variant of Proposition 3.3 
that applies even in the non-split case. It does, however, provide a bit less 
control on the number of punctures needed (and cf. Remark (c) after the 
proof of Corollary 3.6 below). 


Proposition 3.5. Let I be a finite group, let N be a normal subgroup 
of order prime to p, and let G = T'/N. Let {nj,...,n-} C N be a rel- 
ative generating set for N in T. Let V — U be a G-Galois étale cover 
of smooth connected affine k-curves whose smooth completion Y — X is 
tamely ramified over some point £ of B = X —U. Then there is a smooth 
connected I-Galois cover W — U branched only at r points £,,...,£&, such 
that W/N x» V as G-Galois covers of U; n; is the canonical generator of 
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inertia group over €;; and the smooth completion of W — U 1s tamely 
‘amified over €: 


proof. The fundamental group II := 7)(U) is a projective group, since 

cd(Il) < 1 [Se2, Prop. 1}). So the surjective homomorphism II—+G corre- 

sponding to V — U lifts to a homomorphism II — I, say with image Go. 

“Tet Vo — U be the Gog-Galois cover corresponding to this lift. Thus we have 
an unramified N 1 Go-Galois cover Vo — V. Let Yo be the smooth comple- 
tion of Vo. Then p does not divide the degree of Yo — Y, since that degree 
divides |N|. Since Y — X is tame over €, it follows that so is Yo — X. 
Moreover any inertia group of Yo — X over € must normalize N, since N 
js normal in I’. 

Since Gg —»G =I/N, the group I is generated by N and Go. Hence [ 
js generated by nj,...,n, and Go; ie. {n1,..., nr} C N is asupplementary 
generating set for T with respect to Go. So by Proposition 3.1, there is a 
[T-Galois cover W — U having at most r branch points &),...,&- such that 
the G-Galois cover W/N — U agrees with Vo/(NMGo) — U, and such that 
n; is the canonical generator of an inertia group of W — U over &;. Since 
Vo/(N Go) is isomorphic to V as a G-Galois cover of U, and since p does 
not divide the order of N, it follows that W — U is as desired. O 


Corollary 3.6. The assertion of Corollary 3.4 carries over to finite embed- 
ding problems that are not necessarily split, provided that one instead takes 
r>rkp(N). 


Proof. Since r > rkp(N), there is a relative generating set {n1,...,n7} C N 
for N in T. The proof then proceeds parallel to that of Corollary 3.4, but 
using Proposition 3.5 instead of Proposition 3.3. 0 


Remarks. (a) Remark (a) after Corollary 3.4 carries over as well to Corol- 
lary 3.6. 

(b) Corollary 3.6 can also be proven by applying Proposition 2.1 to 
Corollary 3.4. This uses that II is projective; that rkp(N) > rkp(N, Go) 
(where Go is as in the proof of 3.5); and that the Go-cover Yo — X is 
tamely ramified over € (as in the proof of 3.5). 

(c) As mentioned above, 3.3 and 3.4 apply only to split embedding 
Problems, whereas 3.5 and 3.6 apply more generally to embedding problems 
that need not be split. But in the process of reducing to the split case, we 
obtain weaker conclusions in 3.5 and 3.6 than in 3.3 and 3.4 (though under 
More general hypotheses). Specifically, different notions of generators and 
tank are used in the two pairs of results, and the notion of rank in 3.5 
and 3.6 will typically be larger (when both make sense). The need for these 
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variant notions here is due to the fact that one does not in advance know the 
group Go C I that arises in the proof of 3.5 (and indirectly, in 3.6), in the 
process of reducing to the split case. Another way in which the generalizeg 
hypothesis leads to a weaker conclusion here is that one no longer has the 
same control on the inertia groups over B — {€} in the situation of 3.5 thay 
one had in 3.3. This is because the map Go — G in 3.5 need ! not be 
an isomorphism, and because one does not know a priort which choice of 
Gy) CT over G will be needed in the construction. 

(d) In the results in this section of the paper, it would be desirable 
to prove that the positions of the new r branch points can be specified in 
advance. In a related situation, such a result with control on the addj- 
tional branch locus appears in Section 4 below. But there, unfortunately, 
connectivity cannot always be guaranteed. 


Section 4. Results via lifting. 


In this section another special case of the main theorem in proven, by means 
of lifting to characteristic 0. As before in Proposition 3.5, we have a finite 
group T and a quotient G = T'/N, and a G-Galois étale cover of smooth 
affine k-curves V > U. And as before, the problem is to show that there 
is a P-Galois cover W — U dominating V — U, having at most rkp(V) 
branch points with specified inertia there, under the assumption that N 
has order prime to p. But unlike the situation of the previous section, we 
need not make any tameness assumption here on the smooth completion 
of V — U. What is shown here (Prop. 4.1) is that if the relative rank 
rkp(N) is at most 1 (or if p = 0), then such a connected W exists, and 
moreover that the position of the extra branch point can be given in advance. 
(On the other hand if rkp(N) > 1 and p > 0, then we still obtain a W 
with specified branch locus and inertia, but conceivably it might not be 
connected.) Thus if rkp(N) < 1 or p = 0 then we can obtain a proper 
solution to the corresponding embedding problem (Cor. 4.2). 

The method of lifting and specializing to characteristic 0, in order to 
study fundamental groups in characteristic p, was used by Grothendieck 
(cf. [Gr], [GM]) in the situation of the tame fundamental group — with 
the strongest conclusions obtained on the maximal prime-to-p quotient of 
m1. The idea is to work with a mixed characteristic complete discrete val- 
uation ring R, whose residue field is the given algebraically closed field & 
of characteristic p. By using the knowledge of 7; in characteristic 0, one 
can construct a cover over the general fibre; close this up over R; and then 
specialize to the closed fibre to obtain a cover over k. The main difficulty 


in extending this method to more general covers is that the restriction to 
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ne closed fibre may be inseparable over the generic point or it may have 
wild ramification there. Nevertheless, in [Ra, §6], Raynaud was able to use 
this method, in conjunction with a careful analysis of r! amification along 
the closed fibre of a semistable model, in order to construct covers of the 
affine line in characteristic p in a key case (and thereby complete the proof 
of Abhyankar’s Conjecture for A). 

In addition to the problem of specializing characteristic 0 covers to 

characteristic p, there is also the problem of lifting a given characteristic 

cover to characteristic 0. Again, this was done by Grothendieck in the 
tame case ({Gr], [GM]). In the wild case, this is not in general possible, since 
some characteristic p curves violate the Hurwitz bound on the number of 
automorphisms that a curve of genus g can have. But M. Garuti has proven 
a modified lifting result, which will be sufficient for our purposes. Namely, 
he has shown [Ga, Thm. 2] that if we are given a G-Galois cover Y — X 
over k, and if a lift X* of X to R as above is given, then (possibly after 
enlarging Ff) there is a normal G-Galois cover Y* — X™* over R whose closed 
fiber Y;" — X is closely related to Y — X. Specifically, Y,* is an irreducible 
curve whose only singularities are cusps over wildly ramified branch points 
of Y — X, and Y is the nor! malization of Y/. 

Using Garuti’s result to lift, followed by a construction in characteris- 
tic 0 and then descent to characteristic p, we obtain the following version 
(Proposition 4.1) of the main theorem of the paper. Note that in the proof, 
after constructing a I-Galois cover W* in characteristic 0, we do not in gen- 
eral know that its closed fibre IVf is irreducible. So instead we will choose a 
suitable irreducible component W of W;, which will be ['’-Galois for some 
cr. But if rkr(N) < 1 then Wy will in fact be irreducible, and so we 
will have I’ = T in this special case. 

We state the result in a slightly more general form, in which we specify 
in advance the extra inertial elements n;,...,n,, but do not require them to 
constitute a relative generating set for N. In this generality we still obtain 
a T-Galois cover with the desired properties except for connectivity (and so 
the Galois group of a connected component will be a subgroup of F'). But 
when the n; form a relative generating set, and r < 1 or p= (0, then we do 
obtain connectivity (cf. part (c) below). 


Proposition 4.1. Let T be a finite group, let N be a normal subgroup of 
order not divisible by p, and let G=T/N. Let S = {n,,...,n,} be a finite 
subset of N, with r > 0. Let V — U be e G-Galois étale cover of smooth 
connected affine k-curves, and let €,,...,€- € U be distinct points. 


a) Then there is a subgroup I’! C l and a smooth connected ['-Galois cover 
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W — U branched only at &1,...,&-, such that T = NI’ and n, € [VY (if 
r > 1); W/(N NI") is isomorphic to V as a G-Galois cover of U; and the 
canonical generator of each inertia group over €; is conjugate to n; in r. 
with n; being equal to the canonical generator of some inertia group ove; 
Carrs lore =0. 

b) Ifr < 1 or p=0 then we may also require that S CT’. 


c) If S is a relative generating set for N in T, and if either r < 1 or p=. 
then we may take I’ =T. 


Proof. Let Y — X be the smooth completion of V — U. Let B= X ~— U, 
which is a non-empty finite set of |B| pomts. Let B’ = {&,...,&-}, which 
is a subset of U; and let U’ = U — B’. There are two cases to consider: 


Case A: p— 0. 

(a), (b): Let g be the genus of X and let n = |B|. Thus n > 1. By 
(Gr, XIII, Cor. 2.12], the fundamental group 7(U) is generated by elements 
@1,...,4@g,61,...,b9,¢1,...,Cn subject to the presentation (*) of §2 above. 
Similarly, 7;(U’) is generated by elements @1,...,@g, b,, ee by, ee ome oe 
subject to the analogous presentation (*)’. The natural map 71(U')—»7(U) 
takes Gj + aj, bj H bj, GH; forl<j<n,and¢e;+lforn<jcntr. 

The G-Galois étale cover V — U corresponds to a surjective group 
homomorphism ¢:7({U) > G. Let a; = ¢(a;) and 8; = $(6;) for 1 <i <g, 
and let y; = ¢(c;) for 1 < j <n. For each such 7 and 7 choose elements 
ai, Bi, 7; € lover a3 0577; €°G. Alsogletyes =; EN for ls 
Then there is a unique homomorphism ¢ : ™(U’) + T given by (a) gia re 
(b;) = 3;, and o(¢;) = 7; forl1<j<n+r. (The image of ¢c, is uniquely 
determined by the single relation (*)’ for 7;(U') and by the requirement 
that @ be a homomorphism.) Let I” be the image of ¢. Thus S c I”. 
Also, ¢ lifts the surjection ¢, and so NI’ = [. Thus @ corresponds to 
a connected I’-Galois étale cover W — U that dominates V — U. The 
smooth completion Z — X of W — U has description (¢(é1), ee $(En+r)) 
(cf. §2 above). In particular, n; = b(En+;) € N is a canonical generator of 
inertia above €; for 1 < j <r. This cover is thus as desired, for parts (a) 
and (b). 

(c): By part (b), I’ contains S. But S is assumed to be a relative 
generating set for N. Since NI’ =—T it follows that I’ =[T. 


Case B: p > 0. 

(a): Let R be a complete discrete valuation ring of mixed characteristic 
and residue field k (e.g. the Witt ring over k). By [Gr, III, Cor. 7.4], there is 
a smooth complete R-curve X* whose closed fibre is X¥. By [Ga, Thm. 2], it 
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Jlows that after replacing R by a finite extension (and X” by its pullback 
> that extension), there is an irreducible G-Galois cover Y* — X™" of 
‘proper At-curves with Y* normal, such that Y 1s the normalization of the 
closed fibre Y,° of Y*, and such that Y; is everywhere unibranched, with 
the morphism Y — Y;,' being an isomorphism away from the wildly ramified 
points of Y. (Here the residue field of the enlarged F is still k, since k is 
algebraically closed.) 

Suppose that R C T'is a finite extension of complete discrete valuation 
rings, and let Yp — X7 be the normalized pullback of Y* — X* from R 
to T. Thus Yj is the normalization of Y“ x x+ X7. Now the closed fibre of 
y* — X™* is generically étale, but the closed fibre of X7; — X™ is totally 
ramified. So the irreducible schemes Y* and X7 dominate no common non- 
trivial covers of X*. But Y* and X} are normal; so the intersection of 
their function fields is that of X*. Also, since Y* is Galois over X”, the 
corresponding extension of function fields is also Galois. So by [FJ, p. 110], 
the function fields of Y and X*> are linearly disjoint over that of X“. Thus 
Y* xx- X>7 is irreducible and hence so is Yp . The conclusion is that the 
generic fibre Y° of Y* is geometrically irreducible. 


Since X™* is regular and Y* is normal, Purity of Branch Locus applies 
to Y* — X* [Na, 41.1]; and since the closed fibre is generically étale, it 
follows that the branch locus B* defines a cover of Spec R. Also, since 
X* — Spec R is smooth, it follows from [Gr, III, Thm. 3.1] that there 
are R-points €* of X* that lift the k-points €; of X = Xj. The €} have 
pairwise disjoint support, since the points €; are distinct and since FR is a 
complete local ring. Let B” be the union of the loci of the £7, and write 
m= X= BU HU" SB", Va xe Wy and VY SV xe UU". 

Let K be the fraction field of R, and let K be the algebraic closure 
of K. Let X° = X* xp K: Bo = B* xp K; U°? = X°— B°. and V° = 
V* xy- U°. Similarly, write B’’ = BY xp K, U’ = U° — BY”; and V? = 
ve xgoU". Thus V° > U° and V”? > U" are G-Galois étale covers which 
are connected (since Y° is geometrically irreducible). Also, B’° consists 
of r distinct K-points ea. oat £9 specializing respectively to the k-points 
€1,...,€, — viz. é°9 Kon xp K. By Hensel’s Lemma, the compatible system 
of roots of unity {¢,,} in k lifts uniquely to such a system in A, and so in 
K. With respect to this system, we may consider the canonical generators 
of inertia of covers defined over K or K. 

By Case A over the characteristic 0 field K, there is a subgroup fo CT 
containing nj,...,n, such that NI) =I, together with a smooth connected 
l'o-Galois cover W° — U°® of K-curves that dominates V° — U° and whose 
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canonical generators of inertia at points @° over €° are equal to n,;. Let 
No = NNTo, so that [o/No & G. For some subfield K CK that is finite 
over K, this cover descends to a connected I'o-Galois cover of K-curves With 
the corresponding properties (and so in particular it dominates the induceg 
G-Galois cover V° — U° of K-curves). Replacing K by K, and R by its 
integral closure in K, we may assume that there is a smooth connected 
T'o-Galois cover W° — U® that dominates V° — U® and is étale away from 
the general fibre B’° of B’; and that there is a K-point w? on W°® over é° 
at which the canonical generator of inertia is n;. 

Let W* be the normalization of U* in W°, and let W* = W* xp. U™ 
Thus W* — U™ is [o-Galois and W*/No = V™ as G-Galois covers of [/*. 


7 iv yi* 


and similarly for } — U'. Moreover W™ — U” is étale except possibly 
on the closed fibre. Since p does not divide the order of No C N, any 
ramification of W' — V”™ along the closed fibre is tame. So applying 
Abhyankar’s Lemma, and after replacing KA by a finite separable extension 
(and R by its normalization in this extension), we may assume that W’" 
V™ is étale along the general point of the closed fibre. Since V™ — U’* 
is étale, it follows that the ['9-Galois cover W’* — U"™ is étale along the 
general point of the closed fibre, as well as away from the closed fibre. But 
U'™ is regular and W” is normal. So Purity of Branch Locus implies that 
Ww’ — U”™ is étale. Hence the closed fibre Wey — Uz of W* — U™ is étale 
over U', 

Let w? be the closure of w? in W™ and let w; be the closed point of w?. 
Thus w? is an R-point of W* over €*, and w; is a k-point of Wi over €. 


Claim: The k-curve IVF is smooth at the point w;, and the inertia group /; 


there is cyclic with canonical generator n;. 


To prove the claim, we first apply [GM, Corollary 2.3.6] over the com- 
plete local ring of X* at €£; € X C X™*. In our situation, that result says 
that the restriction of the tame J;-Galois cover Spec Ow«., — Spec Ov ¢, 
over its closed fibre is also tame (in the sense of [GM, Def. 2.2.2]). In par- 
ticular, this restriction Spec Ows 4, — Spec Ou ¢, 1s normal, and the inertia 
group J; is acyclic group. Thus the k-curve WY is regular at w;, and hence 
smooth there (since k is perfect). Moreover J; is abelian, and n; € J; (sincé 
n,; is in the inertia group of W° at w°), so (n;) is a normal subgroup of Ji 
The intermediate cover Spec Ow+u,/(ni) — Spec Ou- ¢, is then a normal 
T;/(n;,)-Galois cover which is unramified except at the closed point (where 
it is totally ramified). By Purity of Branch Locus it follows that this in- 
termediate cover is trivial, and so J; = (n;). That is, the inertia group of 
W* — U* at w; is generated by n;. (This conclusion can also be reached 
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, reasoning as in the proof of [Fu, Theorem 3.3, Case 1], instead of using 


M].) Se Ow-«, 1s of the form Oy ¢, [pee |, where f; is a local uniformizer 


a 

ng &), where m; is the order of n;. Since n; 1s the canonical generator 

W° at w?, it follows that the generator n; € J; acts on the overring by 

= fe pee, Restricting to the closed fibre, we find that n; is the 
4 


canonical generator of W, — U at w;. This proves the claim. 


Now W; — U is Galois, étale over U! = U — {&,...,€,}, and smooth 
at the point w; over €;. So W; is smooth. Hence there is a unique irreducible 
component W of W; that contains w;. Let I’ C To be the decomposition 
group of the generic point of W. Thus W — U is a I’-Galois smooth 
connected cover that is branched only at {€),...,&,}. Also, by the claim, 
the canonical generator of inertia at any point of W; over &; is conjugate 
to nj; in To; and in particular this is true for the points of W over &;. 
Moreover the inertia group at w; € W has canonical generator n,, and 
so we have that nj € I’. And since V is irreducible, the composition 
W—- W, — Wy/No & V (which is a morphism of covers of U) is surjective 
on points. Thus W/(N) MT’) & V as G-Galois covers of U. This implies 
that NoI’/No & T'’/(NoNT’) & G &To/No; but No, I’ CT 9. So Nol’ = To 
and thus NI’ = NI'y =I. Thus W has the desired properties. 


(b): This is automatic: If r = 0 then S is empty; and if r = 1 then 
S = {n,}, and n, € I” by (a). 


(c): The same proof works as in Case A(c). O 


Reinterpreting the above result in terms of embedding problems, we 
obtain: 


Corollary 4.2. Let X beasmooth connected projective k-curve, let U C X 
be a dense affine open subset, and let Il = 7,(U). Let E = (a: 11 — G, 
oT G) be a finite embedding problem for II, such that p does not divide 
the order of N = ker(f). Let U' C U such that U — U’ has cardinality 
r>rkp(N). Ifp=0 orr <1, then the induced embedding problem €' for 
I’ = 7,(U") has a proper solution. 


Proof. The surjection a : II+G corresponds to a connected G-Galois étale 
Cover V — U. By Proposition 4.1, there is a smooth connected [-Galois 
Cover W — U branched only at the r points of B’ = U — U’, and which 
dominates V — U. Its restriction W’ — U' is étale, and dominates the 
Testriction V’ — U’. Hence it corresponds to a proper solution to the 
Induced embedding problem &! for I’ = m,(U"), O 


Remarks. (a) It would be desirable to extend Proposition 4.1(c) to the 
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case that r > 1 with p > 0. While the main theorem of the paper (Theorem, 
5.4) does give a connected I-Galois cover W — U that dominates a Ziven 
G-Galois étale cover V — U and has r additional branch points (even if 
r > 1), it does not allow control over the positions of those branch Points 
(except for the first). 

The difficulty in extending the proof of 4.1(c) tor > 1, p > 0 is this: In 
the proof of Case B of 4.1, if S is a relative generating set for N in I’, then the 
cover W* — U™ is irreducible and [-Galois (using (c) in the characteristj, 
0 version); but it is unclear whether its closed fibre W{ is also irreducible 
ifr > 1. It would suffice to show that Z; — X is unibranched at its wildly 
ramified points, where Z; is the closed fibre of the normalization Z* of ¥* 
in W*. One approach to this would be to use Purity of Branch Locus over 
Y*; but for this, one wants Y* to be regular. This raises the question of 
whether Garuti’s result [Ga, Thm. 2] can be strengthened to show that Y* 
can always be chosen to be regular. 


(b) The proofs of the above results require the base field k to be alge- 
braically closed, because of the use of [Ga, Thm. 2] in Proposition 4.1. 


Section 5. The main result. 


This section contains the main result of this paper, that a given I'/N- 
Galois étale cover V —+ U of an affine k-curve is dominated by a [-Galois 
cover of U that is branched at rkp(N) points of U, where T,N are the 
reductions of [,N modulo p(N). In the case that the order of N is prime 
to p (so that f =F and N = N), we prove this essentially by combining the 
special cases in which it has already been shown: the case that the smooth 
completion of V — U has a tamely ramified branch point (cf. §3), and the 
case where rkp(N) < 1 (cf. §4). This is done in Proposition 5.1. Afterwards, 
this result is combined with a result of F. Pop (cf. Thm. 5.2 and Cor. 5.3 
below) to prove the general case (Theorem 5.4). This is then interpreted in 
terms of embedding problems (Corollary 5.5). 


Proposition 5.1. Let [ be a finite group, let N be a normal subgroup 
whose order is not divisible by p, and let G=T/N. Let {n1,...,n-} CN 
be a relative generating set for N in T, with r > 0. Let V =~ U bea 
G-Galois etale cover of smooth connected affine k-curves. Then there is a 
smooth connected [-Galois cover W — U branched only at r distinct points 
£1,...,€» €U, such that W/N is isomorphic to V as a G-Galois cover of U, 
and n; is a canonical generator of inertia over €; for 1 < 1 < r. Moreover 


we may specify the position of €; in advance, if r > 0. 
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: If r < 1 then this assertion is contained in the statement of Propo- 
sy 4.1. So we may assume r > 1. 

Let £1 € U be any point. We may apply Proposition 4.1 with S = {n,}. 
x so yields a subgroup G CT and a smooth connected G-Galois cover 
_,U branched only at €;, such that T = NG andn eG. V /(N NG) is 
smorphic to V as a G-Galois cover of U; and n, is the canonical generator 
an inertia group over €). 

: Let X be the smooth completion of U, let U’ = U — {&}, let ve 
be the restriction of V to U’, and let Y be the normalization of X in V’ 
ior equivalently, in V). Thus Y — X is a G-Galois cover that is tamely 
‘ramified over the point €; € B’ := X —U’. Moreover nj, is the canonical 

generator of some inertia group over €;, and this inertia group normalizes N 
since N is normal. In addition, f = NG, and so the relative generating set 
{n,.- Mr} for N is in particular a supplementary generating set for P with 
respect to G. Since n; € G, it follows that {no,...,n,} is a supplementary 
generating set for T with respect to G: 

So we may apply Proposition 3.1 (with G,N,r—1,V! =6U ppg... . 2, 
playing the roles of G,E,r,V — U,e,,...,e, there). As a result, we obtain 
a smooth connected [-Galois cover W' — U’ having at most r — 1 branch 
points £.,...,€ € U’, satisfying the three conditions (1i)-(1i1) there. That 
is, the G-Galois cover W'/N — U’ agrees with V’/(N MG) — U’; there are 
inertia groups of W’ — U’ over the branch points €9,...,€ having canonical 
generators no,...,n, respectively; and the inertia groups of Y — X, over 
each point of B’—{,} = X —U, are also inertia groups of Z — X over that 
point, where Z is the smooth completion of W’. Let W be the normalization 
of U in W’. Thus W — U is a smooth connected T-Galois cover branched 
only at €,,...,€,, with n; a canonical generator of inertia over &; for each 
t>1. As a G-Galois cover of U, the intermediate cover W/N is isomorphic 
to V/(N 1G) and hence to V. O 


Remarks. (a) Unlike Proposition 4.1, the above result yields a connected 
l’-Galois cover over the given G-Galois cover even if there are two or more 
elements in the relative generating set, in characteristic p > 0. But on the 
Other hand, in 5.1 we lose control of the positions of the branch points other 
than €,, and of the inertia group over €,. Similarly, unlike the results of §3, 
the above result does not require that the smooth completion Y — X of 
V —U have a tamely ramified branch point. But on the other hand, in 5.1 
We have less control over inertia groups, and do not have a version that is 
analogous to 3.1 using the smaller number rk(#,G) of new branch points. 

(b) In the proof of the above result, it is tempting to try to invoke 
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Proposition 4.1 repeatedly on successive n;’s, rather than to use Propositi, 

3.1 (and thus to control the positions of all of the branch points, thong, 
losing control over the inertia groups). The strategy would be to take the 
subgroup G C TI containing n,; and surjecting onto G, given by the Use 
of Prop. 4.1 as above; to take the minimal G-invariant subgroup \V; ¢ N 
that contains ng; and then to form the semi-direct product I’ = NixG With 
respect to the conjugation action of G on N, inT. Applying 4.1 again would 
yield a subgroup G’ C I’ that contains (ng, 1) and surjects onto G under the 
second projection; and then one could take the image of G’ in T under the 
multiplication homomorphism yp : fT’ —T given by (n,g) H ng. This iMage | 
u(G") CT would then contain ng and surject onto G, and one might hope to 


repeat the process. But unfortunately, u(G) need no longer contain m1. Foy _ 
example, suppose that we are given p > 3; N = S3; G = (g), cyclic of orde | 
p, and [ = N x G: with n; = (12) and nz = (18) being (relative) generators | 
of N. Then we could have G = ((12),g) C T (viewing N,G as subgroups of | 
Tr); Ni = N; 1’ = NxG; G = (((13), 1), (12), (12)), (1,9) CNG =P; | 
and u(G") = ((13),g) C IT, which is strictly smaller than T = N x G and | 
does not contain the first generator n; = (12) of N. 


Finally, we combine the above proposition with a result of F. Pop, to 
obtain our main theorem (Theorem 5.4 below). In this theorem, unlike 
the previous results in this paper, we permit the order of the kernel N 
to be divisible by p. The following is a rephrasing of Pop’s result [Pol, 
Theorem B]: 


Theorem 5.2. [Pol, Thm. B] Let fT = QxG where Q is a quasi-p group; 
let ¥ — X be a smooth connected G-Galois cover of k-curves; and let € € X 
(possibly a branch point of Y — X). Then there is a smooth connected 
['-Galois cover Z — X dominating Y — X, such that Z — Y is branched 
only at points of Y over €, and the inertia groups of Z — Y over those 
points are the Sylow p-subgroups of Q. 


Although this result is stated just for the split case, it implies a result 
in the more general case of group extensions by a quasi-p kernel. Namely, as 
the following corollary states, if / is a smooth connected affine k-curve and 
E = (a, f) is a finite embedding problem for 7)(U’) with ker(f) a quasi-P 
group, then € has a proper solution. 


Corollary 5.3. Let Q be a normal quasi-p subgroup of a finite group T, and 
let G=T/Q. Let V =U be a connected G-Galois étale cover of smooth 
affine k-curves. Then there is a connected [-Galois étale cover W — U 
dominating V — U. 
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bof. By [Se2, Prop. 1], the group II = 7;(U’) has cohomological dimen- 
n 1 and hence is a projective group. The G-Galois étale cover V — U 
responds to asurjection a : If — G, and there is a natural quotient map 
ipa G. Thus é = (f :f — G,a: I — G) is an embedding problem for 
e projective group II, and so it has a weak solution ag : I — TI, say with 
ge Go. Let Po = QxGo, where the semi-direct product is taken with 
espect to the conjugation action of Go on Q<T, and let fo : [9 ~Gp be 
the natural quotient map. 
‘i The surjection ag : Il — Go corresponds to a connected Go-Galois 
atale cover Vo — U, say with smooth completion Yg — X. Theorem 5.2 
now applies to the group [9, the Go-Galois cover Yo — X, and a point 
€€ B:= X —U. So there is a smooth connected Po-Galois cover Zg — X 
dominating Yo — X such that Zp — Yo is branched only at points over €. 
Thus Zp — X is étale over U, and corresponds to a surjection 8p : II -~To. 
Now fo = ao since Zp — X dominates Yo — X. So {po is a proper 
solution to the split embedding problem & = (a9: II — Go, fo: To — Go). 
Thus by Proposition 2.1, the original embedding problem € = (f :T > G, 
a : Ii — G) has a proper solution § : I] + T. Here ff = a. The map 
B corresponds to a connected [-Galois étale cover W — U, and this cover 
dominates V — U because f3 = a. oO 


Combining the above with Proposition 5.1 yields the main theorem of 
the paper: 


Theorem 5.4. Let N be a normal subgroup of a finite group I, and let 
G=T/N. Let N = N/p(N) andl = T/p(N), and let r = rkp(N). Let 
V — U be a G-Galois étale cover of smooth connected affine k-curves. 
Then there is a smooth connected T-Galois cover W — U branched only 
at r distinct points €,,...,&- € U, such that W/N is isomorphic to V as a 
G-Galois cover of U. Moreover we may specify the position of £ in advance, 
ifr > 0. 


Proof. First observe here that p(N) is a characteristic subgroup of N, and 
So is a normal subgroup of I. Hence T = I'/p(N) is well defined. 

Pick €; € U. Since r = rkp(N), there is a relative generating set 
{f1,...,2.} C N for N inT. Also, P/N = (T/p(N))/(N/p(N)) & C/N = 
G. So by Proposition 5.1, there are distinct points €,...,€ € U — {&,} 
together with a smooth connected T-Galois cover W — U branched only 
at S = {€,...,&} such that W/N is isomorphic to V as a G-Galois cover 
of U. Let U' = U —S and let W’ be the inverse image of U’ in W. We 
May now apply Corollary 5.3 with Q = p(V), and with W' — U' playing 
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the role of V — U there. As a result, we obtain a connected I-Galois étale 
cover W’ — U' dominating W! — U’. The normalization W = U of y = 
W’ — U’' is then as desired. . 


Remark. In the situation of Theorem 5.4 above, if the short exact Sac 
quence 1 — p(N) — [ + T — 1 is split, then in the proof we can apply 
Pop’s Theorem (5.2 above) rather than Corollary 5.3. Doing so gives More 
information about inertia. In particular, suppose we are given an Integer 
r > rkp(V) and a relative generating set {n),...,n-} C N for N in r. 
Then in the split situation we may choose the [-Galois cover W — 1) So 
that the branching over é9,...,€ is tame, and so that n; is the canonical] 
generator of inertia at some point over €; for each 7 > 1. 


Reinterpreting the above result in terms of embedding problems, we 
immediately obtain: 


Corollary 5.5. Let UV be a smooth connected affine k-curve, let Tl = 7(U), 
and let E = (a: I] — G,f : T — G) be a finite embedding problem for 
II. Let N = ker(f) and let r = rkp(N). Then for some set S C U of 
cardinality r, the induced embedding problem €' for TI! = 1(U — S) has 
a proper solution. Moreover, if r > 0, then one of the points of S can be 


chosen in advance. 
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Introduction 


The absolute Galois group Gg over the rational number field Q can be con- 
sidered as a subgroup of the Grothendieck-Teichmuller group GT which acts 
faithfully on the free profinite group Fy of rank 2 (the algebraic fundamental 
group of the projective line minus 3 points). A key property characterizing 
GT as a subgroup of the automorphism group Aut F) of F» is the so-called 
5-cycle relation. It is not known whether Gg = GT. 

In this article, we shall study the action of GT on the double commutator 
quotient F2/F3 of Fy. In particular, we shall show that the “beta function” 
B, (o € GT), which together with the “cyclotomic character” A, describes 
this action completely, enjoys the “I,-factorization property” as in the case 
of o € Gg proved earlier by G. W. Anderson. The key to our proof is the 
d-cycle relation for GT. We shall then discuss which of the main known 
properties of [, for o € Gg can possibly be generalized to the case of 
o € GT. As basic tools, we shall use profinite free differential calculus and 
profinite Blanchfield-Lyndon theorem, stated with proofs in the Appendix, 
which may be of independent interest. 

Although the first main property of B,(o € Gg) is thus shared by 
B,(o € GT), the author suspects that Gg € GT, and also that Ander- 
son’s multiplication formula for [g(¢ € Gg), which is arithmetic in nature, 
does not hold if o is an arbitrary element of GT (whose definition is ba- 
sically geometric and the key “5-cycle relation” seems to be “used up” to 
prove the I',-factorizability of B,). 

The definitions and main results are stated in §1. They are “general- 
izations” of Anderson’s, [Ao], from the case of Gg to GT. First, we recall 
the definition of the group GT and define for each o € GT the associ 
ated adelic beta function B,, which is a function on (Q/Z)? with values in 
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@ Qas)* (Qas is the maximal abelian extension of Q). Then we state 
me main properties of B, (§1.6 (Theorem 1, etc.)). As a consequence, B, 


gctors as 


= P,(s1)05(s2) 
Ba(si;82) = T,(s1 + 52) 


with some (W@Qaq,)*-valued function [,(s) on Q/Z, where W = II, W (Ep) 
(the product of the ring of Witt vectors). This defines the hyperadelic 
amma function I, (o € GT), modulo some elementary factors. We shall 
{hen examine which of the main known properties of T,(s) for o € Gg 
still holds when o € GT. One of them is Anderson’s formula [A.](i) Th.5, 
connecting Dlog [, (o € Gg) with the Kummer 1-cocycles associated with 
certain cyclotomic elements. We shall show that this remains valid for o € 
GT in an appropriate sense (Theorem 2, §1.8). Another is his multiplication 
formula [A2](i)Cor. 8.6.3 for T,(¢ € Gg): 


(s1,82 € Q/Z), (1) 


(T] P(6))( [] Pe(e))-'Po(a)7? = 1@ (n-))271 (2) 


nb=a ne=0 


(a € Q/Z,n EN), which has its origin in the Gauss multiplication formula 
for the classical gamma function. This appears to the author rather as a 
key property which possibly distinguishes [, for o € Gg from those for 
go € GT. (See §1.8 for the definition of “(n~{*))?-!” for ¢ € GT.) We 
note that the above formula (2), related to the Gg-action on the double 
commutator quotient of the fundamental group of P’ — {0, 1, co}, is inviszble 
from the Gg-action on its quotient modulo any fixed member of the lower 
central series (see §1.9). 

In §2, we shall give proofs of statements given in §1. As for the proof of 
Theorem 1, instead of using the 5-cycle relation in its bare form, we shall 
use the essentially equivalent “active” form, the extendability of the action 
of GT on the fundamental group of P! — {0,1,00} to that on the sphere 
braid group on 5 strings. 

The final section §3 is for the Appendix. 


1 Statement of main results and discussions 


1.1 


First, we shall fix notations related to free groups and braid groups. 
F,: the (abstract) free group of rank r (r > 1), 
B,: the Artin braid group on n strings (n > 2) 
= (b),--+,bn-1): the standard generators, 


1 z ~+1n 


b;: > an vy a ie 
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P,: the pure Artin braid group on n strings (P, 4 Bn) 
= (pi; )igicjen, Pig = (07-1 ++ Opgr be (bpea ~~ - b:4.,)74 
J; 


a J n 
Pij: sees Go ae ree Pji = Pij- 


The composition of braids is from the right 6/b = 6'0b (b,b’ € Be). Define 
the elements p; (2 <i <n) of P, by 


ne Pi = PiurP2i +++ Pi-14. 


Note that the p;’s are mutually commutative. The (27)-rotation w, = 
P2--:Pn generates the center (~ Z) of B,. Consider the quotient groups Be: 
Pe of Bn, Py defined by the extra relations “pn = wn = 1”, and for eath 
b € By, denote its image on B* by 6”. The relation “pp = 1” corresponds 
to adding the point at enim and “wy, = 1” corresponds to dividing by 
the (27)-rotation. The group P* is the fundamental group of 


((Pg)" — A)/PGL2(C), 
where Pi is the complex projective line and A is the hyperdiagonal 


A = {(21,-+-,2n) € (PG)";2i = 2; forsome i# j}. 


IZ 


Now let us recall the definition of the Grothendieck-Teichmiller group GT. 
In what follows, for any discrete group I’, I will denote its profinite comple- 
tion. The groups [, for which we shall consider [ in this article, are always 
residually finite, i.e., the canonical homomorphism T — I is injective. We 
shall use the same symbol for an element of I and its image in. Let G be 
any topological group. Then its abelianized quotient (resp. its commutator 
subgroup) will be denoted by G* (resp. G’), so that G2? = G/G’. The 
double commutator subgroup of G is G” = (G’)’. By Aut G, we shall mean 
the full automorphism group of the topological group G. 

Now let F> be a free group of rank 2, with a given set of free generators 
z,y. Then, by definition ([D], {I3]; see also [I-M] A.2), 


[ »| 3 €Z%,7f € Fy,8.t. o(e) = 2%, o(y) = foly? 
— ) 29 92h 4 bast ) y) a f y f : 
GT =<o0€ Aut Fy and A,f satisfy (2)(3)(4) below 

(1) 


pa Cen ghaqere ss em (2-cycle relation), (2) 
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Biz, z)2" f(y, z)y"f(2,y)e" =1, if syz=1,m= 3(\- J), 
(3-cycle relation), (3) 


f (x12, 23) f(v34, 245) f (251, 212) f(223, 234) f(eas, 231) =1 
in fe 


3, Where zi; = pj; (5-cycle relation). (4) 


ere, for a profinite group G and a,f € G, f(a, @) will denote the image 
f feé F. under the unique homomorphism Fy — G that maps zx to a and 
to f. 

It is easy to see that 4, f are uniquely determined by o. They will 
sometimes be denoted by A,, fo, respectively. It is known that GT forms 
a subgroup of Aut F2, and that the absolute Galois group Gg = Gal(Q/Q) 


of Q (Q: the rational number field, Q: its algebraic closure in C) can be 
embedded into GT through its action on the algebraic fundamental group 


F, = m(P§ — {0, 1,00}, 01) = (z,y), (5) 


x Y 


(cf (D][I4]). The homomorphism GT — Z™* defined by ¢ — A, coin- 
cides with the cyclotomic character x on Gg. We also recall that each 
o € GT maps z = (zy)! (the loop around 00) to g7!z*g,,. where gg = 
fo(2, z)e80-) (ef [Ig)) 


7.3 


In general, for any profinite group G, Z((G]] will denote the completed group 
algebra of G. It is a profinite ring obtained as the projective limit of all 
finite group algebras (Z/n)[G/N], where n (resp.N) runs over the positive 
integers (resp. open normal subgroups of G). When G is free profinite, or 
free abelian profinite, we write: 


A, = “2[F-]], (1) 
A, = 2Z2Z]] (r>1). (2) 
In practice, F, will always be given together with its free generators 21,-°-, 


t,, and Z’ will appear as its abelianization F2°, The image of zr; on F2 will 
usually be denoted by the boldface x; (1 <i <r), and Z” (though written 
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additively) will be considered as a multiplicative group, which could better 
be expressed as 


_ = 


(3) 


The image of 4 € A, under the projection 


wv, : A, — A, (4) 
is usually denoted by A%° (except that 29° are abbreviated as x,). In this 
article, the two cases r = 2,3 appear frequently. 


1.4 


Now the group GT acts on F2 = (x,y) (by its very definition), ane hence also 
on its characteristic subgroups FL FS Fs’ and their quotients (F328 te il 
etc. Our main aim is to study its aetion on (F%4)2?, 

First, since Ps = Fo/F! is a free abelian profinite group of rank 2 
generated by the images x,y of z,y, we shall identify Pe with Z? via its 
basis x,y, and write A» = Z((F 3") The action of Fab on (Fees induced 
from the conjugation (n — fnf—! (n € Fi, f € Fo)) equips (F4)*° with a 
structure of A»o-module. 


Proposition 1.4.1 (Fa) is a free Ag-module of rank 1 generated by (z, y), 
; tt 


the class mod Fy of (z,y) = xyxr~*y 

The pro-@ version of this statement was proved in {I,], and the profinite 
version stated without proof in [I3]. We sal provide its proof in §2.1. 

Now let us study the GT-actions on F2> and (F4)%. First, ¢ € GT 
acts on the abelian group pe as the \,-th power automorphism, Beeuise it 
maps the generators x,y to their A,-th power. Extend this to a continuous 
ring automorphism action of ¢ on Ay = Z{[F2°]] in the unique way. The 
natural action of ¢ on the A-module (F4)?° is semi-linear, and is expressed 
by a single element B! of A¥ defined by 


o(z,y) = By -(z,y) (1) 


x*? — ] yore d 


alin ‘By. 2 
x-—l y-—l (2) 


This makes sense because x — 1, y — 1 are not zero-divisors of A», and the 
two factors in front of B, are units of Az (Proposition 2.1.1 below). We 
shall see later ($2.2) that B, can be expressed as the abelianization of a 
certain derivative of f,. 
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w. Anderson has shown [Ao] that it is theoretically useful to embed the 
mpleted group algebra A, = = Z|[Z"]] of Z” (r > 1) into the ring 


Fet((Q/Z)",Z @ Qas) (1) 


. (Z@ Qa» )-valued functions on (Q/Z)", where Qas denotes the maximal 
pelian extension field over Q in C. His embedding 


A, © Fet((Q/Z)",Z@ Qap) 
W W (2) 
Ee B 


js defined as follows. Take any (s,,--:,s-) € (Q/Z)’, and choose any pos- 
itive integer N such that Ns; = 0 for all 7. Express the image of 6 on 


2[(Z/N)"| as 
b= > ca-a, (3) 


a 


where a = (q@),---,a,) runs over (Z/N)", and cg € Z. Define the function 
B by the formula 


Bl sige? 43h)-S S: Ca @ exp(—24V—1(a151 +--+ a,S,)). (4) 


Then 6 — B is an injective ring homomorphism. 


For example, the element (0,---,0, iH 0,---,O)€ Z corresponds to the 
function B(s,,---,8,) = exp(—27/—1s;). 

By means of the embedding (2) for r = 2, we shall regard B, (o € GT) 
asa (Z@Qqz)-valued function on (Q/Z)?, and call it the adelic beta function 
(or simply, the beta function) associated with a. Since Bg is invertible in 
Ag, it is (Z@ Qas)*-valued. Moreover, it follows directly from the definitions 
that 

Bor(s1, 82) = By(81, $2)Br(X951, A082) (5) 


(o,ré GT). Moreover, when o € Gg, Bg(s1, 52) coincides with the adelic 
beta function defined and studied by Anderson [Ag] (which generalizes the 
f-adic power series studied in [I,]) (see [A] (ii) §5.6). We shall study some 
basic properties of B,(s1, 2) for o € GT. 


Remark The minus sign in the exponential in (4) is for adjusting our 
definition of B, (¢ € Gg) with Anderson’s. His generator of Z(1)? arising 
from that of Z(1) called ¢.. corresponds to our generator x~',y~* of jaue 
His association in [A2](i)§13.1, 2 — (1t.u)")(14 v)s Should iz Eola 
by z— (1+ u)7"@(1 + v)7 3), 
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1.6 


First, we shall list some properties of B,(o € GT) that can be Proved 
without using the 5-cycle relation, in the following 


Proposition 1.6.1 Leto € GT. Then 
G) Be(si,0) = B,(0,s2) =1@1, 


ims —ITS 
= € 


(i) Bo(s,-s) =A, @ = (@=V-1), 


INg _ pwmihing 


(iil) Bo(si,s2) = Bo(se, 51), 
(iv) Bj(s1,$2)Bs(s3,—s3) is S3-symmetric on 5; +59 +53 = 0. 


A deeper result which uses the 5-cycle relation is: 


Theorem 1 [fo € GT, then 
B,(s1,$2)Bo(s1 + $2, $3) (1) 


is S3-symmetric in s1,89,83. Under (iv) of Proposition 1.6.1, this is also 
equivalent to saying that 


B,(s1, $2)Bo(s3, $4) Bo(s1 + $9,583 + $4) (2) 
1s Sq-symmetric on s; +$9+53+ 54 = 0. 


The proofs will be given in §2. 


5 FY 


In [A], G. Anderson has shown that his adelic beta function B,(a¢ € Gg) 
has a factorization, which is analogous to the factorization of the classical 
beta function in terms of the classical gamma function, and studied some 
fundamental properties of its “factor”, the hyperadelic gamma function Ig: 
We are now going to examine which of these properties can be generalized 
to the case where ¢ € GT. 

Let W = kis W, be the product over all primes p of the ring Wp = 
W(F,) of Witt vectors over F, (an algebraic closure of F, = Z/p). It 
is linearly topologized by taking {nW}nen as the fundamental system of 
neighborhoods of 0. For each r > 1, we consider the completed group 
algebra ; 

A, = W{[Z"]] = lim, y((W/nW)(Z/N)")), (1) 
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4 embed this into the ring 
Fet((Q/Z)", W @ Qas) 


(W @ Qap)-valued functions on (Q/Z)’, exactly in the same way as in the 
ace of Ar described in §1.5 (cf. [A>] (i) for more details). The following 
iy observation is due to Anderson ([Ag2] (ii) §2.8; Theorem 6). (It is a 
; ild) generalization of the well-known fact that, over W,, there exist no 
jon-trivial extensions of the formal completion of the multiplicative group 


Gm bY itself.) 


Yemma (G. Anderson) Let B be a unit of the ring Ay = W[[Z?]] satisfying 
(i) B(si, 82) = B(s2, 51) (81,52 € Q/Z), 
(ii) B(s1, $2)B(s1 + 52,53) = B(s1,$2+83)B(s2,83) (81,52, 53 € Q/Z). 
Then there exists a unit T of the ring Ay = W[[Z]] such that 


['(s1)P'(s2) 


Bhais83)— P(s) + 52) 


(2) 
holds for all s,,s2 € Q/Z. 


For each B, the element IT is determined only modulo multiples of ele- 
ments of the group 


Ee={FE Ax. E(s, +s) = E(s;)E(s2) Vs1,s2 € Q/Z}. (3) 


Anderson has also shown ([As] (i) Proposition 7.3.1) that € consists of 
exactly those elements of Af (a priori AX, but in fact Af‘) of the form 
# €pX1", where x, is as in §1.3, p runs over all primes, ep € Z (the coef- 
ficient ring) is the idempotent associated with the projection — Zp, and 
(cp)p is an arbitrary element of Il, Z. (Note that this series converges in 
Ar.) 

Now let o be any element of GT, and B, € AX be the associated beta 
function. Then B = B, satisfies (i)(ii) of the above lemma, by Proposi- 
tion 1.6.1(iii) and Theorem 1. Therefore, there exists [, € A*® satisfying 


re I; $9 
B,(s1,52) = Sea (s1,52 € Q/Z), (2), 


and I, is unique modulo € (multiplicatively). Each I’, satisfying (2), will 
be called a branch of the hyperadelic gamma function (or simply, the gamma 
function) associated with ¢. When o € Gg, this corresponds to Anderson’s 
“branch of the hyperadelic gamma function”. It is easy to see that if 0,7 € 


GT, and [,,[f'; are some branches associated with o,r, then T,7(s) = 
D,(s)l',(Aos) is a branch associated with or. Since the elements of € are 
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group homomorphisms from Q/Z into (Z@ Qas)*, it is clear that for an 
$1,:+:, 5k € Q/Z and n,---, ny € Z satisfying : 


k 
s nis; — 0 (in Q/Z), (4) 


the value of the product 


k 
[[ v.(s:) E(W@ Qu)” (5), 
i=l 
is independent of the choice of a branch T',. For which s1,---,s% ang 
M1,°++,N~ can we compute the value of (5),? The first obvious examples 


are given in 


Proposition 1.7.1 Leto € GT, andT, be a branch of the gamma function 
associated with o. Then 


(i) P,(0) = 1@ Ls 
(ii) To(s)P'.(—s) =». ® 


ei™s _ p-its 


Hien gaees © (¢ CRIES V=1) 
Proof. These follow immediately from Proposition 1.6.1 (1)(i1) (respec- 
tively). 
C) 
For each s € Q/Z, (s) will denote its representative in Q in the interval 
(0,1). When s,---,s~ € Q/Z and nj,---,ng € Z are such that s;.---,5, F 
Q, and moreover that the sum 


w= >: ni (As;) (A € Z*) (6) 
1=1 
is independent of A and belongs to Z, Anderson has connected the value of 
(5)> with that of 


k 
(QrV—1)-” [ [T((si))™, (5)c 
1=1 
where [ is the classical gamma function. In fact, then the value of (5)c 1s 
known to be algebraic (N. Koblitz and A. Ogus), and by using Deligne’s 
theory of absolute Hodge cycles, he proved a beautiful formula connecting 
(je with “(G)e. te, 


Theorem (Anderson [A2](i) Theorem 2) Under the above assumptions 0” 
$1,°°+, Sp € Q/Z and nj,---,ng, one has 


— 
1 


k k 
[] Po(si)" = x(o)* @ {(2rv=1)-” FT Psi) }°73, 


t= 1 i=l 
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jp any © © Gg and any branch Tz of the gamma function associated with 
Here, x(a) ts the cyclotomic character. 


a direct application, using the Gauss multiplication formula 


n—-1 . n-1 
(Ty 24 J] r2))tr(s)? = n'* (ec 
j=0 j=! 


(nEN,se C), he proved 
Corollary ([A2](i) Corollary 8.6.3) Leto € Gg and Tg be any branch of 


the gamma function associated with o. Let n be any natural number, and 


a€Q/Z. Then 


(TT Po(6))( TT To(e))“Po(a)-t = 1@ (n= (9) )2-2. (8)o 


nb=a ne=0 


Remark As for the value of ]],,._. I'(c), it can be computed either from 
Proposition 1.7.1, or by combining Anderson’s connection formula (7) with 


the classical forrauila 
no 


I] t(4) = (27) n8. Or 
1 

Note that the “sign” of (1/2) depends on the choice of a branch T,. The 

result reads as follows. 


TI] Tole) =x(o)*> @((V=1) 


ne=0 


“3 /n)?-}..-n codd, 
| e#1/2 £ x(0)37} Q ((J/—1)!- 7 Van)?-} ---n teven, (9)o 
.  Po(3)? = x(¢)@(V=T) 


Suppose now that oc is an arbitrary element of GT. Then since we still 
have I, for such o, the left hand side of (8), is defined, which is an element 
of (We Qas)* depending only on o. And as we shall see in $1.8, it is easy 
to define an object corresponding to (n~{*))?-! for any o € GT, which 
is again a root of unity (whose order dividing the denominator of a). A 
basic question is whether (8), remains to hold for all o € GT. The author 
Suspects that (8), could be a key property that distinguishes Gg from GT. 
We shall come back to this subject in §1.9. 


1.8 


In this section, we shall define, for each n € N, 7€ Z/n and r € Q/Z, a 
Certain l-cocycle «(c) = Ke (oye € GT). When o € Gg and z = 0 (resp. 


Tor IHARA: On beta and gamma functions 


t #0), this is a Kummer 1-cocycle associated with n~” (resp. (1 ~ C~i). 
where ¢, = exp(27/—1/n). We shall also show that a basic resul; 

Anderson (Theorem 5 in [Ag](i) §11.4), which connects DiogT, With the 
system of .(c) when o € Gg, remains valid on GT. This will be States 
as Theorem 2, whose proof will be given in §2.6. (This gives an alternatiy, 
proof for the case o € Gg.) . 

We shall give two equivalent definitions of Kk, }(o), of which the first ; 

geometric and the second, group theoretic. Let o € GT. The first definitio, 
is based on the interpretation of f, € Fj given in [I4](§1). Let Af be the 
maximal Galois extension of Q(t) unramified outside t = 0,1, oo, embeddeg 
in the field Q{{t}} of all formal Puiseux series in t. We shall identify 
Fy = (x,y) with the Galois group Gal(M/Q(t)) in the way described there 
Since t}/" € M is abelian over Q(t), and since fs € F4, fz stabilizes each 
element of M of the form 1 — Cz't!/". Moreover, all roots of 1 — Crttln 
are contained in M. Therefore, for each r € Q/Z, there is a unique root of 


unity K(o) = Ki). (c) such that 
Fo((L— Goel" )')) = wh (o) (1 Geen), (1) 


where the (r)-th power of 1 — ¢7't!/" is chosen to be the one with value | 
ab t= 0, 


Proposition 1.8.1 [fo € Gg, then 
Kn (7) = a(n '")) /n~(r) EA Bee 0, 
= o((1 — Gr Xl“) 1¢1 — c= ty tr) id 0. (2) 


Here, for any root of unity ¢ #1, the branch of log(1—C) which determines 
our choice of (1—C)!") is the one whose imaginary part has absolute value 


< 1/2. 


Proof. This follows easily by using the description of the action of f, on M 
given in [I4]§1 (i.e., first, let c~? act on the Puiseux coefficients at t = 0, then 
make an analytic continuation along (0,1), then let o act on the Puiseux 
coefficients at t = 1 in 1 —¢, then continue analytically back to ¢ = 0 on 
0,1)). 
(0,1)) 
Now, to give an equivalent group theoretic definition of Ki) (o), consider 
the kernel H, of the homomorphism F, — Z/n defined by x — 1, y — 0- 
Then H,, is normally generated by y and 2”, and is free (profinite) of rank 
n+ 1 with free generators x~*yr'(0 < i < n) and 2”. As FEC Hy, fo 
belongs to Hy. 
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osition 1.8.2 For anyo € GT andi ce Z/n, Ki. (o) is the rmage of 
nder the unique homomorphism Hn — foo (the group of roots of unity 
5) defined by 

za? — 1, r-tyr' — exp(2xV—I1r), zJyxi — 1(j i). (3) 
of. First, note that H, = Gal(M/Q(t!/")), and that the inertia groups 
ye the places ae = 0,¢,(0 < 3 < n) are generated by some H,- 
jugate of r",r2 4 yz!, respectively. Moreover, 


(c~tyz")(1 = eae Ae = exp(27V—Ir)(1 m! emis Tas oe. ‘ (4) 
r assertion is now obvious. Oo 


roposition 1.8.3 For each o € GT, there erists a unique element V, € 
such that if we write 


We= > W(c)x7* mod (x - 1) (5) 


t(mod n) 


Be jmlo) = Cem) (6) 


meallm=1,2,--:. 


Proof. Since r — Ks) (a), for fixed o,n,i, transforms addition to multipli- 
cation, the formula (6) for all m determines WP (c) € Z. Since the product 
(j) 
is 2 
to (0), as can be checked directly from the definition of Kn -(c), the 


simultaneous congruences (5) have a unique solution WV, € A). O 


of x; (0), for all 3 (mod nd) which project to a fixed i (mod n) is equal 


Now let D : A; — A, be the unique continuous W-linear derivation 
such that D(x~!) = x7}, or equivalently, D(x) = —x (cf. [Aa](i)§11.1). 
This can also be defined as follows. Let x,y be free generators of Z?, let 
A, = W/[[Z"]], and for each A € Aj, denote by A{x} resp. A{xy} its image 
in Ay under the continuous W-algebra embeddings A, — Az defined by 
X— x resp. x + xy. Then D(,) is the image of 


A{xy} — A{x} - 
ne a Team (/) 
y-l 
under the projection Ay — A, defined by y — 1. As in [Ago], for each 

A € AX, we define 
Diog X := AT" D(A). (8) 
The following theorem is an enlargement, from the case ¢ € Gg to 
7 € GT, of Anderson’s Theorem 5 of {As](i) (with different methods of 
Proof), 
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Theorem 2 For any o € GT, 


Dilogl, —(Dlogl,)(0) = W, (9 


holds for any branch Tz of the gamma function associated with o-. 


The proof will be postponed until §2.6, as some results of §2.2 Will } 
used. At any rate, the proof is quite elementary. We only note here thas 
for any F € €,Dlog E € Z (a constant) (cf. [Ao](i) §11.1); hence the left 
hand side of (9) is a priori independent of the choice of a branch of = 
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Anderson’s multiplication formula (8), reflects the existence of a family of 
Gg,,-stable abelian subquotients of F2 whose members are mutually jp. 
dependent as subquotients but are closely related to each other as Gg.,- 
modules. To explain this in a simplified situation, let us fix an odd prime 
number /, and let s,,s2 € Q/Z be such that s;, 52,5, + sy have the same 
exact denominator /” for some n > 1. Let C = C(s1,52) be the curve over 


Q(pie) with function field 
K(s1, 52) = Q(ui)(t, t)(1 — t)'*?)), 


and J = J(s1,s2) be its Jacobian. Let 6* = 8*(s,,52) be the automorphism 
of A'(s1, 52) over Q(z) that leaves t invariant and multiplies ¢{*+) (1 —2)'*2! 
by Cm = exp(27/—1/l"). Let @ = 6(s1, 52) be the automorphism of C 
such that (@"f)(@P) = f(P) holds for all geometric points P of C and 
f € K(s1,8); the induced endomorphism of J will also be denoted by @. 
Denote by A = A(s,, 52) the quotient of J modulo the image of 


i-1 
ya 
1=0 


Then A is an abelian variety with complex multiplication by the ring of 
integers of Q(¢jn) (identified with Q(@) via (jn — 6), and the Tate mod- 
ule T(s1,s2) := Ti(A) is a free Z; ® Z[¢jn]-module of rank 1. We note 
that A(s1, 52) is the primitive part of J(s1,$), and so, as abelian varieties: 
A(s1,s2) and A(ls,,ls,) are (at least in general) unrelated to each other. 
Still, as Gygyy,..)-modules, T(s1, sz) and T(Is;,1s2) are related to each othe! 
as follows. Let ¢ € Gg,,,.). Then o acts on T(s;, s2) as a multiplication of 
some J, = 3,(s1,82) € Z; ® Zin]. The result of [I,] §11, Cor. of Theorem 


4, restated in terms of B,, reads as 


Proposition 1.9.1 The multiplier 3,(s,,59) ts equal to the projection of 
Bo(-si, —s2) on Z; @ Qap. 
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since the conjugates of 8,(s1,52) over Qi(Gia-1) are 35(As,,As2) with 
1 (mod I"-1) counted mod /”, Anderson’s multiplication formula §1.7 
(for n= l,a = 81,52) gives directly the following 


yollary 1.9.2 
Noi(en )/Qr(C,n-1 \(3o(s1, S2)) = Pe ltlsi; Iso) (n > ony 


+ me stress again that this is not a mere compatibility, but is something 
ore surprising. 

The equality of this type is quite analogous to Coleman’s norm relation, 
nd indeed, Coleman himself made significant contributions to the discovery 
nd the use of the above relation (cf. {C][I-k]). 

Now let FL? denote the maximal pro-! quotient of PS. Then, as described 
n (1, }§11, each T(s1, 82) is canonically isomorphic, as CQ(u,.0)-module, to 
a certain quotient of (Foy Rey, and as its quotient, T(s,,s2) and 
P(ls;,{sq) are unrelated. If we truncate Ff,’ by a member of its lower 
entral series, then, only some finite (over F;) quotient of each T(s;, s2) re- 
mains. So, the phenomena as shown in Corollary 1.9.2 cannot be observed 
if we only consider the Galois actions on the Lie algebra over Q; of F4” 
“modulo a fixed member of its lower central series. It is easy to see that each 
quotient T(s,,s2) of (FO) (RO is stable under the action of GT. But 
it is hard to believe that one can prove such a relation as Corollary 1.9.2 
(and hence such a relation as Anderson’s multiplication formula (8),) for 
any element o € GT. A key property defining GT as a subgroup of Auth» 
Is the 5-cycle relation which seems to be already “used up” in proving the 
decomposability of B, in terms of [,. As a closely related matter, it seems 
to be an interesting problem to determine the image of the 1-cocycle 


Gg do—T, € AX /é. (1) 


In the pro-/ case, this study has already been done by Coleman, Ichimura 
and Kaneko (cf. [I-K]). 


1.10 


Some additional remarks. In §1.8, we defined for each n € N,ie€Z/n, ré 
Q/Z and o € GT a certain root of unity Ky) (a). They satisfy 


Ke) (0) = KP (a)e), (0) (r,r’ € Q/Z), (1) 
I] Glo) = xh(e)  (n,deN), (2) 
y( mod nd) 
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(as already noted in the proof of Proposition 1.8.3), and moreover, 


KO(or) = w(a)(nme (nr) (0,7 € GT). (3) 


The last equality follows easily from Proposition 1.8.2, using the obvious 


identity fo; = f,-o(f,). In particular, Kn e(o Jisa 7 bee on GT. W hen 


oe Gq, this was the Kummer 1-cocycle w.r.t. n~'") (Proposition 1. 8.1), 
Thus, in view of Anderson’s multiplication formula (§1.7 (8),), we are lead 
to define the following subgroup of GT containing Gg; 

GTA is the collection of all elements o of GT satisfying 


([] ro())( [J Pe (a)"'=1@K9)(c) (nEN, a€ Q/Z). (4), 


nb-a ne=0 


It forms a subgroup because Kee) is al-cocycle. A natural question : how 
close is GTA to Gg (or to GT). One related remark. When o € Gog, roy Yo o) 


(for each fixed r) is obviously multiplicative in n, but the author does not 
know yiether this remains valid for any o € GT. However, if o € GTA, 
then kn i (0 r) is necessarily multiplicative in n. This can be deduced easily 
by rewriting the left hand side of (4), for n = nino by performing the 
division in two steps, and by using (4), forn = nj and n= no. 

Another related remark. There is another important property of [,(¢ € 
Gg) proved in [A]; namely, Theorem 3 in [Ag](ii). But this is a formal 
consequence of his multiplication formula; hence its generalization to o € 
GT would simply be a formal consequence of (4). 


2 Proofs of the main results 


In §2.1, after some preliminary remarks on the completed group algebras 
A,, we shall provide a proof of Proposition 1.4.1. Then, in §2.2, we shall 
write down some basic formulae relating B, with f,, and in §2.3, using these 
and the 2-cycle and the 3-cycle relations for f,, we shall settle the proof of 
Proposition 1.6.1. The proof of Theorem 1 will be given in §§2.4,2.5, and 
that of Theorem 2, in §2.6. The readers are advised to look at §3.1 (Theorem 
A-1, the basic rules (i)-(v) for free differential calculus, and Theorem A-2) 
beforehand. 


2.1 


We begin with the following elementary 


Proposition 2.1.1 Let A, = Z{(Z" }] be the completed group algebra of 4 
free abelian profinite group of rank r with generators x,,:--,X-. Then, ™ 
the ring A,, for eachi (1 <i<pnr), 
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(i) xi — 1 is not a zero-divisor, 
(ii) for anyae Z, x? — 1 is divisible by x; —1, 
(iii) foranyae Z* (x — 1)/(x; — 1) 18 invertible. 


roof, (i) It suffices to show that x,—1 is not a zero divisor. Suppose y € A, 
_ y(x, — 1) = 0. The goal is to show that y = 0, or equivalently, 
hat, for any integers m,n > 1, the projection of y on (Z/m)[(Z/n)"] must 
snish. Consider the Saetnod n of y on a “finer” quotient 


(Z/m){(Z/mn) x (Z/n)"~"], 
and call €; the image of x; on (Z/mn) x (Z/n)"~'(1 <i <r). Thus, 


(Z/mn) x (Z/n)’~? = {E% -- £8"; a, €Z/mn, a; €Z/n (2S i <r}. 
Write 7 = >.,ar(E1) mo t+ where I = (i9,---,i,) runs over (Z/n)""?, 
and a;(£1) € (Z/m)[E,] is of degree < mn. Now, since n(€; — 1) = 0, we 
have a7(£1)(&: — 1) = 0 mod (7 — 1) for each I. Therefore, each a;(€,) 
must be a constant multiple of eee, * €;. But then, the image of a7(€,) on 
(Z/m){(Z/n)] must vanish (being killed by the multiplicity m). Therefore, 
the projection of y (which is the same as that of 7) on (Z/m)[(Z/n)"] must 
be 0, as desired. 

(ii) The closed ideal A,(x; — 1) contains x? — 1 for all n € Z. But 
xf -— l(a € Z) can be approximated by such elements; hence x? —1 € 
A,(x; - 1) for any a € Z. 

(iii) Apply (ii) to x? instead of x;. oO 


Proof of Proposition 1.4.1 We shall apply Theorem A-2 (§3) for the case 
Where F = Fo,2,; = x,t = y and N = Fy. It asserts that there is an 
As-isomorphism from (F4)?° onto 


T> = {(a, a’) € (Az)®*; a(x — 1) +a'(y — 1) = 0}, (1) 
Which is induced from the mapping 


#3 mn — (SE), (Sy) € (Aa), (2) 
Or Gy 
Where ab denotes the abelianization Ay = Z{[F.]] + Ag. First, let us check 
that Z. is free of rank 1 generated by (l~—y,x—1). (The only “arithmetic” 
Used in §2!) For this purpose, consider the projection F2? = (x,y) — 
Pp? — (x) defined by y — 1. The associated group algebra homomorphism 
Ay — A, has the kernel (y — 1)Ag. Therefore, if (a,a’) € Z2, then the 
Image of a(x ~ 1) on A, vanishes. By Proposition 2.1.1 (i) for r = 1, the 
image of a on A; must vanish; hence a must be divisible by y — 1. Similarly, 
@’ is divisible by x — 1. Since (x — 1)(y — 1) is also a non zero-divisor 
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(Proposition 2.1.1 (i)), this proves that Z» is free of rank 1 generated by 
(l-—y,x—1). J 
On the other hand, ifn = (x,y) = zyz~!y~! € F4, then 


On —1 ab 
Ben oe l—zyr'—l-y (3) 
dn ~1,-1 46 


—— = ©£-xryr y x—1; (4) 


hence n (mod F%’) corresponds with (1—y,x— 1). Therefore, (z, y) Bives 
a free generator of the Ao-module (F4)®. O 


2.2 


Now we shall present some basic formulae connecting f, (§1.2) and B, 
(§1.4) for any o € GT. Roughly speaking, Bz is the abelianization of some 
derivative of f,. By the profinite Fox theorem applied to F = (z, y), every 
element A of Az = Z[[Fo]] can be expressed uniquely as 


A= s(A) Al (2-1) +Aa-(y—1) (A, A2 € Ad), (1) 
and Ay = 94, \. = 2 (see §3 (Theorem A-l,etc.)) In particular, for each 


oy 
O(f5*) 


ae * O(fz*) 
o a1 ae a a ea (2) 
Define an element wz of Ao by 
a gt O(f5") = gor! Oto cd 
Ye Sd Be eS Ben 1) 
= -19f¢ — — f-l -1 Of = 3 
— 1— f; Or (zx l=fe + fz dy (y iy ( ) 


This is a profinite analogue of the power series defined and studied in [Iz] 
(see also [I3] §6.5 Remark). 


Proposition 2.2.1 


fo(z,y)vo(z,y) = voly,z) (cE GT). 


Proof. The 2-cycle relation gives 


foly,2)fo(z,y) = 1. (4) 
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operating the free differentiation < (w.r.t. x,y) on both sides of (4), 
ng the formulae (i)-(v) of §3.2, we obtain 


Of Of 
Fe (y,2) + foly.2)Z2(2,y) = 0. (5) 


n the other hand, by (3) (the second row equalities), we deduce both 


fal, v)volt,u) = fol2,y) — 22 (x, ye v), (6) 


Ofe 


ve(y,z) = foly,z)-* 4+ foly,2) (4 Gy ew elz- 1) 


fe(2vu) + fol vy) Z2(y,2)(0— 1), (7 


We obtain the desired formula by comparison of (6) and (7) using (5). O 
The image of each element A of Az under the abelianization map Ay —> 
‘Ag is denoted by \?°. Since f, € ee we have 


ay Sl (eo SE7). (8) 
This is an obvious but crucial remark. Note also that (3) and (8) give 


va =1-(F2 yx) = 1+ (yy - (9) 


Proposition 2.2.2 By = ¥2 (o €GT). 


Proof. Put n = (z,y) = xyz !y7!, and let o = (A,f) € GT. Then 
O(n) = (2, f-ty*f). Let us compute (200), which is simpler than 
So(n)) itself, as we may replace any element of F% (e.g. fo) by 1 in the 
abelianization process. A straightforward computation gives 


O(o(n))  _ z*-1 Of a, OF ae ye* 1 
is 2 a fee mer ee ee! gl 
Of of 
Awe —_— 
oY ao? ae 


Where = is the congruence modulo the kernel of abelianization. Therefore, 


O(a(n)) 


(“35 


==) =y" =x) + (My 
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But since (22)2° = 1 — y (§2.1(3)), this gives 
or 


in (F4)2°, where 7 = n (mod Fi’). From the definition of B, and (9), it 
follows that 
Ofe 


By = 1-(x-1)( 32)" = yg? | 


0 


Proposition 2.2.3 Express the image of fz on (F4)° as an Ao- multiple 
of the image of (x,y) on (Pay? and call this element Cz € Az. Then 


2 -14(x—1)\(y—-—1)C, (c € GT). 


Proof. The first coordinates in Zo of the images of f, and (x,y) on (F4)” 
are (Sfe)2, resp. 1—y. Therefore, 


Ofc ab 
= l1- 
( Or ) Co( y); 
hence we obtain the desired formula by using (9). 0 
Corollary 2.2.4 Bz =1 mod (x —- 1)(y — 1) (c € GT). 


Remark From Proposition 2.2.3, it also follows that if o € GT acts oe 
ially on Fe so that B, = 1, then C, = 0 which implies f, € F' 
Therefore, if o acts also trivially on F/F’, so that \, = 1, then o must A 
trivially on F/F". 


2.3 


Now we are going to prove Proposition 1.6.1(i)-(iv). Let o € GT, and 
put m = m, = }(A, —1),B = Bg. Let r be the automorphism of A2 
defined by x — y,y — 2,2 — x, where z = (xy)7!, and write B = 
Bixy},27 = Bly,z}, B” = B{z,x}. Then one checks easily that (i) ~ 
(iv) are respectively equivalent with the following 

[i] B{x,y}=1 mod (x-1)(y-1) 

Gabi eo = A eras 

li] B(x, y) = Bly, x} 

liv] B{x, y}B{z,27!} is symmetric in x,y,z. 


IHARA: On beta and gamma functions 163 


roofs of [i] ~ [iv]. 

fi] This is Corollary 2.2.4 just proved. 

[iii] By Proposition 2.2.1, fo(z,y)Wo(z,y) = Woly,z). But poh: =]. 
hence V2" is symmetric in z,y. By Proposition 2.2.2, this gives [iii]. 


To prove {ii],[iv], we need further the following 


“Lemma 2.3.1 The notations being as above, we have 


y7(1— xy) B{x,y} + (x—1)Bly,z}+x™ y™(y — 1) B{z,x} =0. (1) 


Proof of Lemma 2.3.1. This follows from the 3-cycle relation for f = fo: 
f(z,z)z™fly,z)y™flz,y)je™ = 1, (2) 


by taking Ss of both sides (the free differentiation w.r.t. z,y), then pass- 
ing to the abelian quotient, and then by using the following equalities (cf. 
Proposition 2.2.2 and §2.2 (9)): 


B{x, y}(= o™) =1= (fe)? (x -1 =14+(fy)*-(y-1) (3) 


o. := ol fy = 5). In fact, by letting S operate on both sides of (2), 


using the identities = =z, oftee) = —f,(z,x)z+fy(z,z), and by passing 
to the abelianization, we obtain 


7™ — | 
(— 2" (2.x) + F7"(2%)) — 2 — Py, 2) 
1 
+x7™ f2?(x, y) + x7™ hai 0. (4) 


By multiplying (x — 1)(z — 1) on both sides of (4) and by using (3) (to 
express f2?, {= in terms of B), we obtain (term by term) 


(B{z,x} —1)(xz-—1)-—2(z™ —1)(x-1)- z™*l(x — 1)(B{y,z} — 1) 
+x7™(z —1)(1— B{x,y}) + (2 -1)1—-x7™) =0. (5) 


We see that the sum of terms not involving B{,} vanishes, and by multipli- 
cation of —x™+ly™+1 on both sides of (5), we obtain the desired formula 
(1). O 


Remark We only obtain the same formula if we use aT instead. 


Proof of Proposition 1.6.1 (ii), (iv). Now by applying the involutive 
automorphism x — y,y — x of Ag to the formula (1) of Lemma 2.3.1, and 
by using the already established equality Bfy,x} = B{x,y}, we obtain 


/ 


x™(1 — xy) B{x, y} + (y — 1)Bf{z,x} + x™y™71(x - 1)Bfy,z} =0. (1) 
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m+1 ..m 


Multiplication of x y™ on both sides of (1)’ and subtraction from (1) 
give 4 
y™(1—x*)(1 — xy) B{x, y} + (x — 1)(1— x*y*)B{y, 2} = 0. 

Therefore, 
(1— x*)(1—xy) 
(1 ~x)(1—x*y*) 


By letting z — 1, we obtain 


B{y,z} = y™ - B{x,y}. (6) 


l—-x 


B{x,x7'} = Ax™=——5, (7) 


which is equivalent with [ii]. Moreover, (6) and (7) give 
Bly, z}B{x,x""} = B{x,y}B{z,27"}, (8) 


which implies that (8) is symmetric in x,y,z, whence [iv]. O 


2.4 


Before going into the proof of Theorem 1, we shall first show why the two 
statements in Theorem 1 are easy formal consequences of each other, under 
Proposition 1.6.1 (iv). 

Let o € GT, and write B(s,,s2) = Bz(s,,s2). First, assume that 


B(s1,52)B(s, + $2,583) (1) 
is S3-symmetric, and put sq = —(s; + s2 + $3). Then 


2)B(s3, $4) B(s1 + 82,83 + 4) (2) 
$2)B(s, + 52,54) B(s3, -s3) (by (iv)) 

s9)B(sq + 82, $1)B(s3, —s3) (S3-symmetry of (1)) 
eta $1) B(sq4 + $2,583 + 51) (by (iv)). 


Therefore, (2) is invariant under the transposition (1,4). Since (2) is ob- 
viously invariant under (3,4) and (1,2), it follows that (2) is S4-invariant. 
Secondly, assume that (2) is Sy-symmetric. Then 


B(s,, $2)B(s, + $2, 83)B(s4, —S4) (3) 
= B(s,,82)B(s3,s4)B(s, + $2,53 + $4) (by (iv)) 
= B(s3,52)B(s,,84)B(so + 53,51 + 54) (S4-symmetry of (2)) 
= B(s3,$82)B(so +53, 8,)B(s4 — 54) (by (iv)). 


Therefore, (1) is S3-symmetric. 5 
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ow we are going to prove Theorem 1 (the S4-symmetry of (2)). Our proof 
{| use a G7-action on a suitable subquotient of BS. First recall ({D][I-M]) 
hat GT acts on the profinite completion B, of the Artin braid group By 
or any n > 4, in the following manner; 


o(b;) = bt, o(b:) = f(pi, 67) bs F(pi,b?) (2 <i <n-1). (1) 


Here, o = (A, f) € GT, and 6;,p; € e are as in §1.1. This GT action on 
B, and the trivial GT’ action on the symmetric group S, are compatible 
with the projection B, — S,, and hence GT acts on its kernel which is Py. 
Mach go = (A,f) € GT Piees each of the standard generators pi; of Py to 
some P, -conjugate of Pi; because pj; is B n-conjugate with pyo = b? ando 
acts tnwiolly on By/ Pn. Moreover, we have o(pi) = pp (2 <i <n) and 
o(wn) = wa (cf. [I-M] A.3); hence the above GT actions on Ba, Pe induce 
those on Br, P*. Its action on Px exactly corresponds to the original GT 
action on Fo, va the ss araore hier 


Pi = Fh; Pig = P34 -— Z, (2) 
P93 = Pig Y, 


Pig = Pyq —>* z = (zy)? 


Now let N be the kernel of the canonical homomorphism Pe — Py defined 
by pis — 1(1 <i < 4), ph, — pil < & < 1 < 4). It is the homomorphism 
defined by removing the 5-th string. Then N is generated by t; = piz(1 < 
1 < 4) (not just normally but also) in the usual sense, because, as is well- 
known and easy to see, every Pz-conjugate of t; is an N-conjugate of t;. 
Moreover, N is free of rank 3 on t;,to,t3, and we have t,tot3ty = 1. Since 
N is free, so that N is center-free, the induced homomorphism Ieee P: is 
still injective ([A"] Proposition 3), and so we shall consider N as a (normal) 
subgroup of Pr. We have an exact sequence 


WR | (3) 
F3 = (ti, to, ts) Fy 


of profinite groups, on which GT acts compatibly, because N is GT-stable 
and the standard embedding Px — Pz which splits (3) is compatible with 
the GT-action by the formulae (1). Now define an open subgroup H C B5 
as the unique intermediate group Pz C H C Bz that corresponds to the 
stabilizer of the letter 5 in Ss = BZ/P*. Then N is normal in H, and hence 
H also acts on N by conjugation n — hnh7!(h € H,n € N). We denote 
by s : H — Sy (on {1,2,3,4}) the canonical projection. The key point of 
the proof is the study af the actions of GT and H on N’ [R, and to see 
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how close to being “mutually commutative” these two actions are. (The 
quotient Pz /Pz is useless for this purpose.) But first, we need: 


Proposition 2.5.1 N2 = N/N’ is a free abelian profinite group of ran 
3 generated by the images t; of t)(1 <1 < 4) which satisfy a single relation 
tytotgtz = 1. The groups GT (resp. H) act on N® by o(t,) = ti? (resp. 
h(t;) = tsnyi) (0 € GT,h € H,1 <i < 4). In particular, the normal] 
subgroups GI, = {o € GT;A, = 1} (resp. PS) of GT (resp. H) agy 
trivially on N2?. 


Proof. The first statement is obvious. As for the second, since a(t;) = 
ote g, with some g; € Ps for each i(1 < i < 4), and gitig; * is N- 
conjugate with t; (as noted above), o(t;) is an N-conjugate of t?*: hence ¢ 
acts on N® as the A,-th power map. On the other hand, ht;h-1(h € H) is 
an N-conjugate of t,,);. The rest is also obvious. Oo 
_ Now let us introduce the completed group algebra Z[[N]] of N®. Since 
N® is a free abelian profinite group of rank 3 generated by t;(1 <2 < 3), 
where t;(1 <i < 4) denotes the image of t; € N on N®, we may identify 
Z([N2]] with Az = Z[[Z3]], via the basis t,,to,t3 of N@, and call it As. 
Thus, A3 = Z[[N2>}] and tytet3t4 = 1. Using t;(1 <i < 4), we can embed 
Ag into the ring 


4 


Fet({(s1, 52,53, $4) € (Q/Z)*; ey =0)5 Z® Qas) (4) 


i=l 


of functions as described in §1.5. Let A be the total ring of fractions of 
Az, obtained from Ag by allowing all non zero-divisors as denominators. By 
Proposition 2.1.1, 1 — t(t € N®*) is a non zero-divisor of Ag if t can serve 
as one of the free generators of N°: for example, t = t;, tit; (2 # 7), etc. 
The actions of ¢ = (A,f) € GT and of h € H on N® extend naturally 
to continuous ring automorphisms of A3 and Aj. The o-action on A is 
t; — t\(1 <i < 4), which will be denoted by (\). The h-action on A} is 
the substitution t; — t,c,);, denoted also by s(h). In terms of functions on 

(Q/Z)%, the first action is the multiplication of \ on (s1,°--,$4), and the 
second is the corresponding substitution of indices of s1,---,s4. : 

_ Now let us turn our attention to the next commutator quotient N= 
N'/N". By the conjugation z + frf~!(z € N’, f € N),N ® is considered 
as an N?>-module, and hence also as a module over A3 = Z[[N 2°] in the 
natural way. By the profinite Blanchfield-Lyndon theorem (Th A-2), we 
can (and shall) identify N° with the 43-module 


3 


T3 = {(a1, a2, a3) € ARE SF a(t; — 1) = 0}, (5) 


R=). 
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On 


= ai 


ab ab 
Wn = (SE), (SE). (SEI) € Ts, (6) 
where eS eee 
Ag = ZN] & ZN] = As. (7) 
is the projection induced from the abelianization of N. 


Put vs 
VEN @e2Ao = Ts Oa, AS 


Proposition 2.5.2 V 1s a free A3-module of rank 2 with basis 
e = (ta,t3) (mod N”), e! = (t2,t;) (mod N”). 


Proof. We shall compute the “Z3-coordinates” of e,e’. First, (f4.43). = 
((titotz3)—*,t3) = t3't7\t7'tatite; hence 


| Mate = assist tistgtstis S ta + tata, 


Ot; 
Atets)  .—1,=1 —1,-1,-1 ab -1 
| “Si = tg ty +tg ty ty tat,  —tytyt ty’, (8) 
G(ta,ts pee wel, 1s 1 ee tage ge 
( Bits agg tg tg ty Se tg es, 


hence e has the coordinates 
e = ta(tz — 1,t)(t3 — 1),1— tyta). (9) 
As for e’, the computation is even easier; 
e’ = (tz —1,1—ty,0). (10) 


Since 1 — t;t2,1— +t, are non zero divisors of A3 and hence are units of Ag. 

it is now clear that {e,e’} is a free A9-basis of V. O 
Now consider the special eiesheuis b¥(1 < i < 3) of BE (§1.1). Since 

s(b;) is the transposition (i,i+ 1) and i < 3, each such 6% belongs to H. 


Proposition 2.5.3 (i) The groups GT and H act on the A-module V 
semi-linearly, i.e., (av) = a?-)o(v), A(av) = al) h(y Me E€E GT,he 
Hae ASve V). In particular, GT; and Pr act on V4 A3-linearly. (ii) 
For any o € GT and1<i< 8, there exists some 6 € Pz such that 

gobt =$0(b*)*" og (11) 
on V. 


Proof. (i) is obvious, and so is (ii) if we put 6 = o(b*)(b)~>. Note that 
o(b*) is a (Pz)'-conjugate of (b*)** (as f, € Fy) and hence that 6 € (Pz). 
CO 


Now we shall compute the actions of o and of 65 on the basis e, e’ of V. 
First, we need 
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Proposition 2.5.4 For each o =(\,f)€ GT and1<i< 4, we have 


o(t;) = njt}n>} 


with n; € N(1 <21< 4) given by 


ng=1, ng=tgf(ts,t3)7', no = f(tite,ta)(tit2)7 Ff (te, th), 
ny = no -ty f(te,ti)7*, (13) 


Proof. By almost straightforward computations using the following basic 
formulae, where 6; = b3(1 <i < 4), 7; = pt(2<i< 5). 


o(A1)=Bt, (Bi) = F (mi, BF)" BP (mi, 8?) (2S < 4), (14) 
ta= 6g, ty = (Ba--- Big) B7(Ba-- Big)? (DSi <3), (15) 


and the obvious sphere braid commutation relations: 


meaty 
Bam38q = 73, (B403)73(G4G3)~! = tito, (16) 
(G483)m2(8a83)~' = m2, (848382)72(848382)7! = ty 


The following remarks are also used. Since f € Fe. we have f(£,7) = 
1 if € commutes with y, and also f(€¢,n) = f(€,¢n) = f(E,n) holds if 
¢ commutes with both € and 7 (see [I-M] Appendix A2). For example, 
f(73,t3) = f(piot3,t3) = f(ta,t3) (note that pi.73 = pi.p, = fas = ta, 
because w3 = ps = 1 on BS). Finally, the 3-cycle relation for f (applied to 
x =t to, y = t3,z = t4) is used in order to obtain the formula for no. O 


Proposition 2.5.5 The action of ¢ =(A,f) € GT on the A§-basis e,e’ of 
V is given by 


a(e,e’) =(e,e’)S, (17) 
where S = ( . ) with 
EA a = to BET Bota, ta}, (18) 
a =i" St =p tot — Bz {t2, ti}. 


Here, By{t;,t;} denotes the image of B, € Ao under the continuous ring 
homomorphism Ay — Az defined by x — ti,y — ty. 
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roof. By Proposition 2.5.4, o leaves the subgroup (#3,t4) stable, and maps 
1, t2) to na(ty,t2)n5° . Let us compare these o-actions with the original o- 
ction on F’y = (x,y) defined by x — xy = | lee a via the isomorphisms 
ty,t3) — Fo (ta —2,t3 — y) and (to,t1) — Fo (to — z,t; — y). Then, by 
the above formulae for the n;’s, we see that the action of o on (t4,t3) is the 
‘same as the original one except that here it is followed by t"-conjugation. 
“Also, the action of o on (2,1) is the original action followed by (ngt¥")- 
conjugation. Note that the image of not} on N® is t;™. So, our assertion 
follows directly from the definition of Bg. 0 


Remark Thus, e,e’ are eigenvectors of the A$-linear action of GT; on 
V. This depends on our appropriate choice of pairings e = (t4,t3),e’ = 
(to, t1) mod F%! of indices(!). 


Proposition 2.5.6 Letr=63 € H. Then 


re;e = (ee )7, (19) 
with ; 20 ' 
ae, ee -1 { ta—1, tytgt3(1—t, 

pm bale Ata) ( ty—1, ty, t3(1— ts) vo) 


Proof. Since r(t4) = ta, T(t3) = t3 ‘tats, T(t2) = tz and r(t,) = t,, we have 
Fit4,t3) = (taste tots), T(to,t,) = (t3,t:). We can compute r(e),7(e’) by 
writing down the abelian images of x (1 <i < 3) of these commutators. 
First, since 


t(ta,t3) = tz '(tgtatz',t2)ts = te (te tte tats, (21) 


tz -r(e) is the abelian image of (S-(t7 ty tz Mtotat1))r<i<s, which is equal 
to (—tr't7'+ty',-ty7'4+t4,-tet+ty'tz+). Hence r(e) is 
ta(t2 — 1,-t; + t3',-ty' + totz?). (22) 
Similarly, r(e’) is the abelian image of (32 (ts, t1), 0, a-(ts,t1)), a 
(t3 — 1,0,1—-¢,). (23) 
Our formula follows by comparing these with the coordinates of e,e’ given 
by (9), (10). Oo 
Now, to complete the proof of Theorem 1, apply o = (A, f) € GT on 
both sides of (19), which gives 


(cr(e),or(e’)) = (a(e), o(e’))T™ = (e,e’)ST. (24) 
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But since or = 6r%o with some 6 € Pr (Proposition 2.5.3), and Since 


rT? = pas € Pz acts A3-linearly on V as 


r(e,e’) = (e,e)T- TO), 
where the superscript (2,3) indicates the action of the transposition (2,3) 
on the indices of t;’s, we see that (24) is also equal to 
(6r°™)r((e, e’)S) = 6(7°™(e,e’)T -S'°*)) = (e,e)D(T - TI) ™ TS (2,3) 


__ (24)! 
where D € Mo2(A$) is defined by 6(e,e’) = (e,e)D. But since PF acts 
A§-linearly on V, and since 6 € P3 , we have 


det( D) = 1, (25) 


Now by (24)< (24)’ (since e,e’ form an A$-basis of V), we obtain 


ST) = p(T Te?) )™ 7g), (26) 
hence by (25)(26), 
det S = det(T) det(T~!)™ - det(TT?)™ det (S)?*), (27) 
But since 
det T = -tz*(1 — t,t2)7*(1 — tits), (28) 
we obtain 
det (TT?) = (tat3)7", (29) 
and 


1-18 tatits is 


¥ _ jen 2 
1—tyte 1— t7t3 


det(S) = t det(S)'?9), (30) 


Therefore, by Proposition 2.5.5 


mi ttt? 1— tits 


Bo {t2, ti }B,{ta, ts} = tH ts 7 Bo {t3, ti}Bo (ta, to}, 


(31) 


l-—titel— te ay 


or equivalently, that 


tito 


r- (tuto) ape Be o{t4, t3}B,{te, ti} (32) 


is invariant under the transposition (2,3). But this implies the Sy-invariance 
of (32), and hence (in view of the formula §2.3 (7)) the second formulation 
of Theorem 1. This completes the proof of Theorem 1. 
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6 
roof of Theorem 2. We keep our notation of §1.8. Let o be any ars of 


7. By using Proposition 1.8.2 and Theorem A-2, we shall relate as Va) 
sith the derivative of B,. First, by Theorem A-2 apolied to NV = An, 


a ah—( (3) of 28) Wer MiP ater" (1) 


ROLY jedan NOU J Yapeny) 
nduces an (A;/(x” — 1))-module isomorphism 

Hy ae {(as8) © (Ai/(x” — 1)"; a(x = 1) = 0}, (2) 
Note that (2) maps the images on H2° of 2", z7Jyz? (0 <j <n) to 


0} ,(0 ,(0, x77) , 


pe 
\x-1’ 


respectively. On the other hand, the image of f, is mapped to the special- 
ization of ((f,)2°, (Severs over y — 1,2” — 1. But since 


By = vo" = 1~(fo)e'(x-1) =1+(fo)y (y - 1) (3) 
(§2.2), this specialization has the second coordinate equal to 


Bz-1 


\ y = 1 / Vee ll |x "ed 


This means that if we temporarily write 


Bes 7 a 6 
( y-l Dc ~ 3 ajx? (a; € Z), (4) 


j{ mod n) 


then the image of f, on H2° is given by 


pnt; I] (Fy? )% 


j( mod n) 


with some 6 € Z; hence the homomorphism Hy — foo of Proposition 1.8.2 
maps f, to exp(27/—Ira;). Therefore, Kt) (c) = exp(27/—Ira;); hence 
In particular, 


KO (e) = Ce. (5) 


n,l/m 


By Proposition 1.8.3, we obtain a; = w(c). Therefore, 


(Bo = = ((g)x7? mod (x” — 1 6 
(SS). = So wh(c)x ( ) (6) 


2( mod n) 
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B,-1 
y yal 


Now we compute the left hand side of (7). With the notation of §1.8, the 
equality B,(s,,s2) = [',(s,:)(s2)o(s1 + s2)71 can be expressed as B, 
=T,{x}l,{y}l.{xy}7'. Therefore, 


{ Be) (Potx}Poty} - Pofxy} \ 


for all n € N; hence 


Ry a cs \ I,{xy}(y — 1) = (8) 
(Tofx\Co{y - 1)) 

\ Pr, {xy}(y — 1) Paes 

(T.{xy} —T.{x}\ 


me lr, {xy}(y — 1) ye 


= —(Dlog!,)x=1 + DlogIg. 
Our theorem follows directly from (7)(8). Oo 


3 Appendix 


Profinite free differential calculus and profinite Blanchfield-Lyn- 
don theorem. 


3.1 


In the main text, we made use of the profinite versions of two classical 
theorems on free groups, a theorem of Fox and that of Blanchfield-Lyndon. 
In this Appendix, we shall provide their general formulations and proofs. 
Their “almost pro-l” versions have been treated in {I2]. The profinite version 
of a theorem of Fox (Theorem A-1 below) was, as far as the author knows, 
first proved by G. W. Anderson (a letter to the author [Aj], unpublished) 
using intersections of cycles on finite branched coverings of the complex 
projective line. Here, we shall give simple algebraic proofs for this, and also 
for the profinite Blanchfield-Lyndon theorem (Theorem A-2). 


3.2 


Let ¥ = F, be a free profinite group of finite rank r > 1, and 2,,---,z, be 
a set of free generators. We denote by A = A, = Z[[F]] the completed group 
algebra of F. It is a profinite ring obtained as the projective limit of finite 
group rings (Z/n)[F/N], where n (resp. AN’) runs over all positive integers 
(resp. open normal subgroups of F). Let s: A — Z be the augmentation 
homomorphism which extends the homomorphisms (Z/n)[F/N] — Z/? 
defined by “taking the sum of coefficients”. 
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feorem A-1 (G. W. Anderson). Every element of A can be expressed 


A= s(A)-14+ 0 A(ei- 1) (1) 


ay 


ith Nis a Ar EA. 
This was communicated to the author by G. W. Anderson while we were 
orking on the joint papers [A-I], trying to loosen our pro-! assumption 
ade there. His proof [A] is closely related to a topological method used 
there. But here, we shall provide a straightforward algebraic proof. 
For each 2(1 <2 <r), assuming Theorem A-1, define the map 


=—:A—A by =r. (2) 


he following properties of sS — (1 <i<_r), which follow directly from the 
definition, are used in the man Léext. 


(i) ae is continuous, Z-linear, and At) =U, 
Or; Cz; 
zs Ox; $y a 
(il) tec bij (1S i,j <1). 
wis OLA) = OA 51) On , ea 
(iii) ios. Saat (A) + ieee a (A, A" € A); in particular, 
Oxy 
(iv) ¢-4 as" a gt of for any fe F062. 


(v) For any \ 2 a and ois € F, we denote by X(y1,---, y,) the 
image of X under the endomorphism A — A induced from the homomor- 
phism F — F defined by x; — pi (1<i<r). Then 


a =~ en : 
jai Men) = DG Men ee) ay 


g=1 On 


Theorem A-2 Let N be any closed normal subgroup of the free profinite 
group F of rank r on 2,--:,2-. Put G=F/N, and let A = Z[[F]] (resp. 
A = Z{[G]]) be the completed group algebras of F (resp. G). Denote by 
™:A — A the canonical homomorphism. Then the mapping 


On On 


RS Bag je An 


)) € Am (3) 
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induces a left A-module tsomorphism 


r 


N= {(ay,-+-,a-) € A®"; S$” a;(a(2;) — 1) = 0}, (4) 


"1 


where N® = N/(N,N) 1s the abelianization of N, whose left A-module 
structure 1s the one determined by the G-conjugation on N° induced from 
the F-conjugation 

n—fnf~' (nN, f €F) (5) 


on N. 


3.3. Proof of Theorem A-1 


Let F = Fy, be the free group of rank r on z1,---,z, and F = F, be 
its profinite completion. Let Z[F] be the group ring of F over Z, and 
s : Z[F] — Z be the augmentation homomorphism. Then, as is well-known, 
each element @ of Z[F] decomposes uniquely as 


6 = s(8)-1+ 5° A(x; - 1) (1) 
t=1 


with 6,,---,@, € Z[F], and the Fox free differential operators = : Z[F] 


ZF] are defined by on = 6; (1 < i <r). The following properties of 
ae (1 <i <r) follow directly from this definition. 

(i) & is Zlinear, and [2 =0. 

(ii) F = 55; (1<j<r). 

(ii) = 2 so’) +0 (0,0 ZF). 

Key lemma _ Fir i(1 <i<pr), let n be any positive integer, and N be 
any normal subgroup of F with finite indez. Then there is a unique additive 
homomorphism 0; = Oin.n which makes the following diagram commuta- 
tave. Here, the vertical arrows p,,p2 are canonical homomorphisms. 

Z(F] na ZF] 
Pi | p2 | (2) 


(Z/n)[F/(N,N)N"] 22" (Z/n)[F/N]. 


In short, st composed with the projection Z[F] — (Z/n)[F/N] necessarily 
factors through (Z/n)[F/(N, N)N”]. 


Remark Note that N is finitely generated; hence N/(N,N)N” is finite, 
and hence F/(N, N)N” is also finite. 
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ou 


roof. Put Pi = poo Bo Then 

pi(96") = pi(9)s(") + po(8)oi(0') (8,8' € Z[F}). (3) 
Consider the restriction of »; : Z[F] — (Z/n)[F/N] to the multiplicative 
group NC Z|F]. Then, for any n,n’ € N, since s(n’) = po(n) = 1, (3) 
gives yi(nn') = y;(n)+;(n’); hence ¢;|N is a homomorphism from N into 
the additive group (Z/n)[F'/N] which is abelian and killed by n. Therefore, 
gilN factors through N/(N,N)N”. Now consider the restriction of y; to 
F; 

vilp : F — (Z/n)[F/N], (4) 

and consider (Z/n)[F'/N] as a left F-module. Then ¢;|r is a 1-cocycle (by 
(3)), and factors through F/(N,N)N”. Extend this by linearity to 


0; = Oinww :(Z/n)[F/(N,N)N"] — (Z/n)[F/N]. (5) 


Then, by construction, poo a and 0; ¢ p; coincide on F, and by linearity, 
also on Z[F']. The uniqueness statement is obvious by the surjectivity of py. 
O 


It is obvious that 
O;(80") = 0;(8)s(8") + p(9)0;(6’) (6) 
(9,0 € (Z/n)[F/(N,N)N"]), where p() is the image of 8 on (Z/n)[F/N]. 


By taking the projective limit of 0; = Oin,n for all n,N, we obtain a 
continuous additive map 


8; : ZF] = ZF] (7) 
satisfying 0;(A4’) = 0)(A)s(A’) + AOi(’) (A,X’ € A), O:(1) = 0,0;(z;) = 
6; (1<j<1r). Note that 0; = oon on Z[F] Cc Z[[F]]. 

Now, to complete the proof of Theorem A-1, consider the continuous 


map ¢ : Z[[F]] — Z{[F]]] defined by 
(A) = A—(s(A) + 0 Ai(A)(ai - 1). (8) 


Then it is continuous and y(A) = 0 on the dense subset Z[F] of Z{[F]]. 
Therefore, y = 0 everywhere. Therefore, 


A= s(A)+ » At; 1) (9) 
‘1 
holds for any \ € A if one putsA; = 8;(A). Conversely, if (9) holds for 


Some A,,---,A- € A, then by applying 0; on both sides of (9) we obtain 
O;(X) =; (1<j <r). This settles the proof of Theorem A-1. 
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3.4 Proof of Theorem A-2 


First, we shall prove the following 


Lemma Let J = Ker(s) be the augmentation ideal of A = Z((F)], and f 
be any element of F. Then f —1 EI? tf and only if f € F’ = (F,F). 


Proof. The “if” imENcauon is obvious. To prove the “only if” implication, 
decompose f as f = x{'---r2'g, witha; EZ (1 <i<r)andge Fr 
so that g —1 € I?. Since slements of J? can be approximated by some 
ony combinations of eae of the form iz’ (2,7’ € I), and since Se = 


Sz, € I, it follows that == ve I for any \ € J*. In particular, gh, gt oa EI. q 
a 
f) z;7—1 a 
0=— = pyle TS i + 2%,..g¢r 9 (1) 4 
OL; + it Or; 
= 4a; mod I. 
Therefore, aj = 0 (1 <j <1); hence f=qg Ee F’. oO 


Now we shall prove Theorem A-2. It is along the same line as in the 
proof of Theorem 2.2 [I] (the almost pro-l version). Let A denote the 
kernel of t: A — A. It is a closed two-sided A-ideal contained in J, such 
that A/K — A. Denote by K’-I the closed two-sided A-ideal generated by 
elements of the form ka (k € A,i € I). Then the left A-module structure 
of K induces a left A-module structure of K/K? and also of its quotient 
K/K-I. Our proof of Theorem A-2 will be divided into two steps. They are 
to establish two A-isomorphisms 


r 


K/K-I A {(a1,...,ar) € AP; ) | ai(m(2z) ~ 1) = 0}, (1) 
41 
and 
N® = K/K-1, (1)’ 


whose composite gives the desired A-isomorphism. We denote by Z the left 
A-module on the right hand side of (1). 


Proof of K/K-I ~T. 


Consider the map 


IHARA: On beta and gamma functions 177 
Ve shall see that (2) induces A/A-I = TZ. First, take any k € K.i € I. 
Then since s(z) = m(k) = 0, 


(ki) _ Ok | alt 
an: Fm os Ad cs 


A(z) 
Oz; SR Gr: (3) 


is mapped to 0 by z. Therefore, (2) factors through A’/A-I. Since 


Ok 


k= 3 (55, ) 
j 


— 1) (4) 


and 1(k) = 0, the image is contained in Z. Therefore, (2) induces a mapping 
K/KI—-T, (5) 


which is an A-module homomorphism, because for any A € A,a = (A) € A, 
we have a(S) = m(A da,) = am( Se ) (<< Fer). O 
The injectivity of (5). If k lies in the kernel of (2), then = € Kt for all 3; 
hence k = S°"_, he (2; -lhEe KI 

The surjectivity of (5). Take any coer in Z, and for each 7, choose 
a; € A which lifts aj. Put k = aS 1 4; (2; — 1). Then since r(k) = 0, we 


have k € KX. On the other hand, a; = hence aj = n( ge ao Therefore, 


on 
(a;) is the image of k mod K-I. 
Proof of N® — K/K-I 
This will be induced from the map 
N3n—n-1mod K-I€ K/K-I. (6) 


First, since nn’ —1 = (n—1)+(n’-1)+(n—1)(n' — 1) = (n—-1)4(n'—- 
1) mod K-J, (6) is a group homomorphism into the additive group of K/KJ 
which is abelian. Therefore, (6) induces a group homomorphism 


NOP a6 RIT (7) 


Moreover, ifn Ee N, f €F, then fnf~!—1= f(n—1)+f(n—1)(f-!- 1), 
andn—1éK, f-'—1€ J; hence fnf~!—1 = f(n—1) mod KI. Therefore, 
(7) is an A-module homomorphism. 

It is clear that the composite of (7) and (5) gives the desired A- 
pene in Theorem A-2. So, it remains to prove the bijectivity of 
7). 

The surjectivity of (7). As additive topological group, A’ is generated 
by elements of the form (n — 1)f (n € NV, f € F). (Note that f(n - 1) = 
(fnf-}—1)f.) But (n—1)f = (n—1) 4 (n—1)(f —1) = (n—- 1) mod KL. 
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Therefore, the image of (7) is dense in K/AI. But since VN? is compact 
and (7) is continuous, (7) must be surjective. a 


The injectivity of (7). This is to show thatnE N,n-1EK-] implies 
n€ (N,N). We shall first prove this when A is an open (normal) subgroup 
of ¥. In this case, 


F = | | Nev, C=] (8) 


and A = @,Z[[N]]ep = Z[N]] @ ys ZW] (ev —1). Therefore, 7 ~ 
In ® Bysi1 Z[[N (cv — 1), where Iy is the augmentation ideal of Z((V]]. 
Now let n € N be such that n—1€ K-I. But since K = Ly-A, we have 
n-lé€Iy lI =I @@,5, Iv (cv — 1). Since n— 1 € Z[[N]] and the above 
decomposition of A is unique, this implies that n-—1 € I. Now, since W 
is an open subgroup of F, AN is also free of finite rank. Therefore, by the 
above lemma, we conclude that n € (N,N). This settles the case where AV’ 
is open. 

Finally, in the general case, let n€ N,n—-1EK-I. Let f:N —6 
be any continuous homomorphism onto a finite abelian group. It suffices to 
prove f(n) = 1. Now the kernel of f being open in N, there exists an open 
normal subgroup U of F such that flwaw = (1). Put M* = NU which is an 
open normal subgroup of F, and let A™ be the A-ideal corresponding to N*. 
Then since n—1€ K-I C K*-I, we obtain n € (N“*,N“). This implies that 
if we extend f to a homomorphism f* : N* = ® by N*/U — N/N NU, 
then f*(n) = 1. Since f(n) = f*(n), this proves f(n) = 1, and hence that 
n€(N,N). O 
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Arithmetically exceptional functions 
and elliptic curves 


Peter Muller * 
March 5, 1999 


Abstract 
Let f(X) € Q(X) be a rational function. For almost all primes 
p we can reduce the coefficients of f and consider f, := f mod p 
as a function on the projective line P'(F,) = F, U {co}. Here we 


continue the arithmetic aspects of joint work with Guralnick and Sax], 
and classify the functions f such that f, is a bijection for infinitely 
many primes p. This is the rational function analog of the classical 
conjecture of Schur (1923), solved by Fried (1970), which considered 
the case that f is a polynomial. 

Thereby we also answer a question of J. G. Thompson about the 
minimal field of definition of a certain rational function of degree 25. 


1 Introduction 


A classical problem going back to Schur [Sch23] is the following: Let f(X) € 
Z|X] be a polynomial, which induces a permutation of the residue fields Z/pZ 
for infinitely many primes. Then Schur conjectured (and proved this for 
prime degree polynomials) that f is a composition of linear polynomials and 
Dickson polynomials D;(a, X ), which are best defined implicitly by D,(a,Z+ 
a/Z) = Z* + (a/Z)* for a € Q. Schur’s conjecture has been proved by M. 
Fried in [Fri70], see also [Tur95] and [Miil97]. The obvious generalization of 
this question to number fields poses no difficulties, result and proof are the 
same. 

In recent joint work [GMS97] with R. Guralnick and J. Saxl we inves- 
tigated the rational function analog of this question over number fields J’. 
Let f(X) € A(X) be a rational function, and Ox be the ring of integers of 
i. Fix coprime polynomials r,s € Ox[X] with f = r/s. The coefficients 
of f = r/s can be reduced modulo all but finitely many prime ideals p of 
Ox without making s trivial. Such a reduced function induces a map on the 


“Supported by the DFG. 
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rojective line P'(Ox/p). We say that f is arithmetically exceptional if this 
induced map is bijective for infinitely many prime ideals p. 

It follows from this definition that if an arithmetically exceptional func- 
tion is a composition a(d(X )) of two rational functions a,b € K(X), then a,b 
are also arithmetically exceptional. (In contrast to the polynomial case, the 
converse does not hold even over Q, see [GMS597, Corollary 7.4].) So we can 
and do restrict to indecomposable functions. Define the degree of f € K(X) 
to be the maximum of the degrees of numerator and denominator in a re- 
duced fraction. The degree is the same as the degree of the field extension 
K(X)/K(f(X)). 

The aim of this paper is to classify the arithmetically exceptional func- 
tions over Q. This also answers a question of J. Thompson [Tho90] raised in 
a different context. 

The classification is in terms of the geometric monodromy group oe 
the branching type. Let f be such a function, then Gal(f(X) — t/C(t) 
the geometric monodromy group of f, where Gal( f(X) — t/C(t)) ie 
the Galois group of R(X) — tS(X) over C(t), when f = R/S is a reduced 
fraction of ern Further, for the finitely many points 6,,...,6, € 
CU {oo} with |f—'(6)| < n = deg f let m; be the least common aaltiple 
of the multiplicities the points in the fiber f~'(6;). Then (m,,... ,m,) is 
the branching type of f. The type, together with the geometric monodromy 
group, usually gives precise information about the function f. See Section 
3 for more details; in particular the numbers m; will be seen as the orders 
of the elements of a very specific generating system of Gal( f(X) — t/C(t)). 
The result, which completes work from [GMS97], is 


Theorem 1.1. Let f € Q(X) be an indecomposable rational function of 
degree n which is arithmetically exceptional over Q. Set G := Gal(f(X) - 
t/C(t)). Then one of the following holds, where p is an odd prime and Cy 
denotes a cyclic group of order m. 


(a) n = p, G is cyclic of type (p, p); 
(b) n=p> 5, G is dihedral of type (2,2, p); 
(c) n=e4, G=C, x C2 of type (2,2, 2); 


(d) n= pe {5,7, 11, 13,17, 19, 37, 43, 67,163}, G is dihedral of type 
(2, 2,2, 2); 


(ec) n=p*, G=(C, x Cy) » Ce of type (2,2, 2,2); 

(i) w=" ,.G = (Cs x Cs) 3S) 0f-type'(2,3,10); 

(9) n= 5". Qe (Cs x Cs) 6 (Ce. C2): of type (2,2, 2,3); 

(h) n = 37, G = (C3 x C3) 4 (Cy x C2) of types (2,2,2,4) and (2, 2,2, 2,2); 
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(1) mn = 28, G@ = PSLo(8) of types (2,3,7), (2,3,9), and (2,2, 2,3); 
(j) n= 45, G = PSL2(9) of type (2, 4,5); 


While dealing with the arithmetic of case (g) above, we solve a question 
raised by John G. Thompson [Tho90} about the minimal field of definition 
of a certain rational function of degree 25. 


Acknowledgment. | thank G. Malle and B. H. Matzat for a careful reading 
of the manuscript. 


2 Arithmetically exceptional rational 
functions 


Let B bea finite permutation group on 0, and G<B be a transitive, normal 
subgroup. We say that the pair (B,G) is exceptional, if none of the orbits 
# {w} of a point stabilizer G, is fixed by B,. (This is of course independent 
of the chosen point w € 2.) This notion of exceptionality has first appeared 
in arithmetic questions of finite fields, see [FGS93] and the literature given 
there. 

If the finite group A is acting on 2, and GSA is transitive, then we say 
that (A, G) is arithmetically exceptional, if there is a group B with G < B< 
A, such that (B,G) is exceptional and B/G is cyclic. 

Now fix a number field K, and let f € A(X) be a non-constant rational 
function. Let ¢ be a transcendental over A’, and let L be a splitting field of 
f(X)—t over the rational function field A(t). Denote by K the algebraic clo- 
sure of A’ in L. Then A := Gal(L/K(t)) is called the arithmetic monodromy 
group of f, and G := Gal(L/K(t)) is called the geometric monodromy group 
of f. Except otherwise said, we regard A and G as permutation groups on 
the roots of f(X) —#. We call K the field of constants of f. Note that 
A/G = Gal(K/K). 

The following group-theoretic characterization of arithmetically excep- 
tional functions is due to Fried [Fri78], see [GMS97, Theorem 2.1] for a short 
proof. 


Theorem 2.1. Let f, A, and G be as above. Then f is arithmetically ex- 
ceptional if and only if the pair (A,G) is arithmetically exceptional. 


Remark 2.2. The proof of this theorem in [GMS97] also characterizes the 
prime ideals p modulo which the function f is bijective if it is arithmetically 
exceptional. Namely there is a bound C’ such that if |Ox/p| > C, then such 
an f is bijective modulo p if and only if (B,G) is exceptional, where B/G is 
the decomposition group of a prime of K lying above p. 


ULLER: Arithmetically exceptional functions and elliptic curves 183 


Branch cycle descriptions in geometric mo- 
nodromy groups 


et P! = P'(C) be the Riemann sphere over the complex numbers. We keep 
he notation from the previous subsection, and regard now f as a covering 
ap from P* to P*. Let n be the degree of f. There is a finite set B := 
by, b2,--- »0-} C FP? of elements with less than n preimages. We call these 
slements the branch points of f. 

Fix a base point b) € P’ \ B. and denote by 7 the fundamental group 
m(P’ \ B, bo). Then 7 acts transitively on the points of the fiber f~'(bo) by 
lifting of paths. Fix a numbering 1,2,...,n of this fiber. Thus we get a 
homomorphism 7 — Sym,,. By standard arguments (see [MM] or [V6196]), 
the image of 7 can be identified with the geometric monodromy group G 
defined above, thus we write G for this group too. 

This identification has a combinatorial consequence. Choose a standard 
homotopy basis of P' \ B as follows. Let +; be represented by paths which 
wind once around 8; clockwise, and around no other branch point, such that 
M72°°°Yr = 1. Then y,72,...,¥r-1 freely generate 7. 


bo 


Definition 3.1. Let o € Sym,. Then the index ind(c) is defined to be n 
Minus the number of cycles in o (where fixed points count as cycles too). 


Let o; be the image of y, in Sym,. If the points s;,...,8, in the fiber 
of b; have multiplicities e,,...,¢n, respectively, then o; has cycle lengths 
€1,...,€m. We say that a; has cycle type 1°°2% ---, where a; is the number 


of cycle lengths 7. Note that ind(o,;) =n — |f71(6;)| =n—_m. 
The Riemann-Hurwitz genus formula (or a more elementary argument 
using the derivative of f) gives the following basic relation: 


S— ind(a;) = 2(n—1) (1) 


a 
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We call the r-tuple (01, 02,... ,0,) a branch cycle description of G, an 
the unordered tuple (|o,|, |o2|,...,|o,|) the branching type of the branch 
cycle description. Of course the orders of the 0; do not specify the o,, but 
in most cases where G is fixed the branching types distinguish between the 
various possibilities for f. 
an iteration of the elementary braiding operations Q;,7 =1,... ,r—1, which 
send the tuple (91, g2,-+- Gr) to (9is+++ 5 Gi-15 Git1s Fig 1GiGit1 Gi42s--- 595), 

The elements 0; can also be seen as inertia group generators in a slight] 
more general context. Let K be a field of characteristic 0, and L/K(t) bea 
regular (i.e. A’ is algebraically closed in L) finite Galois extension with group 
G. For each ramified place P; : t + b; (or 1/t + 0 if b; = 00) let a; be 
a generator of an inertia group of a place of L lying above P;. There js a 
natural choice of o; up to conjugacy. Namely set y := t — 6; (or y := 1/t if 
b; = oo). There is a minimal integer e such that L embeds into the power 
series field Q((y1/*)). For such an embedding, let o; be the restriction to L of 
the automorphism of Q((y!/*)) which is the identity on the coefficients and 
maps y!/¢ to y'/* exp(2a/—1/e). The non-uniqueness of the embedding of 
L accounts for the fact that such o; are well-defined only up to conjugacy. 
We call the conjugacy class of 0; the distinguished conjugacy class associated 
to b;. 

Let E be a field between A(t) and L, and let ind(o;) refer to the permu- 
tation action of G on the conjugates of a primitive element of F/K(t). Then 
the genus g of E is given by 


Note that one can arbitrarily order the conjugacy classes of the o, Using 


S\ind(o,) = 2([E : K(t)] - 1+). (2) 


2 


The well-known deficiency of this purely algebraic setup is the following: 
Even for A’ = C there is no known algebraic proof that the o; can be chosen 
with o102::-0, = 1 and G= <oj,02,... ,0;>. 

Subsequently, we will call a place (or branch point) A-rational (or simply 
rational if K = Q) if 6; € KU {oo}. 

Sometimes one can read off from the branch cycle description whether 
the function f is defined over certain fields using the so called branch cycle 
argument, see [Vol96, 2.8] for a fuller version. Let G be a subgroup of A. 
We call an element z € G rational in A, if all powers 7” with m prime to 
|G| are conjugate to o in A. Also, we call a conjugacy class of G rational in 
A, if it consists of elements rational in A. 


Theorem 3.2. Let K be a field of characteristic 0, and L/K(t) be a fr 
nite Galois extension with group A. Setn = [LK : K(t)]. Then a € 
Gal(LK'/K(t)) permutes the branch points of LK /K(t) among themselves: 
Let ¢, be a primitive n-th root of 1, and m be an integer with AMC) = Cs 
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tC, be the distinguished conjugacy class of inertia generators associated to 
e place 6 of h(t). Then Cay) = a*(Cy)™, where a* ts the conjugation map 


In particular, if 6 is K-rational, then the class C, is rational in A. 


A typical application is the following. Suppose f € Q(X). Then the ab- 
solute Galois group Gal(Q/Q) permutes the branch points, but also preserves 
their cycle types. So if there is a branch point whose cycle type appears only 
once, then it must be rational. If the associated element o; is not rational in 
G, then necessarily A > G. 


4 Monodromy groups of arithmetically ex- 
ceptional functions 


The main group-theoretic result from {GMS97] is the following: 


Theorem 4.1. Let A be a primitive permutation group of degree n and G 
be a normal subgroup such that (A,G) is arithmetically exceptional. Further 
suppose that G has a generating system 01, 02,...,0, with o,02-:-o, = 1 and 
SY ind(o;) = 2(n—1). Then one of the following holds, where type means the 
ae type of the generating system. 


(I) n = p* for a prime p, AS N *GL.(p) with N =F is an affine group, 
and one of the following holds: 


(a) (i) n=p> 3, G ts cyelic of type (p,p); or 
(i) n= p> 5, G is dihedral of type (2,2,p); 01 
(wi) n=4, G=C, x C4 of type (2,2,2), A= Ag or Sq. 
(b) n=porn=p? for p odd, and 
(i) G 1s of type (2,2,2,2), G=N™»C), andn > 5; or 
(11) G is of type (2,3,6), G= N «Ce, andn=1 (mod 6); or 
(iat) G is of type (3,3,3), G= N «C3, andn =1 (mod 6); or 
(tv) G ts of type (2,4,4), G=N C4, andn=1 (mod 4). 
(c) (i) n = 11*, G is of type (2,3,8), G/N = GL.(3) and A = 
ALLO \0r 
(it) n = 5°, G is of type (2,3,10), G/N = S3 and A/N = S3x C4; 
or 
(itt) n = 5°, G ts of type (2,2,2,4), G/N is a Sylow 2-subgroup of 
the subgroup of index 2 in GL2(5), and A/G has order 3 or 
6; or 
(iv) n = 5?, G is of type (2,2,2,3), G/N = Ce x Cz and A/G is 


cyclic of order 2 or 4; or 
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(v) n = 3°, G is of type (2,4,6), (2,2,2,4), (2,2,2,6), or 
(2,2,2,2,2), G/N = Cy x C2 and A/G has order 2; or 

(vi) n = 24, G ts of type (2,4,5) or (2,2,2,4), G/N = Cs 10. 
and A/G has order 3 or 6. ss 


(II) (a) n = 28, G = PSL2(8) is of type (2,3,7), (2,3,9), or (252.958) 
and A = PIL,(8). : 


(6) n = 45, G = PSL2(9) ts of type (2,4,5), and either A = Myo, op 
A=PTL,(9). 


If we take a group G and a branch cycle description from this theo. 
rem, then Riemann’s existence theorem implies the existence of a rational] 
function over some number field A’ with G as geometric monodromy group, 
and branching given by the branch cycle description. However, two difficult 
arithmetic problems are left. First, it is not clear how small we may take 
KY, in particular whether we may take K = Q. This is the descent prob. 
lem encountered in the inverse Galois problem. Secondly, it is difficult to 
get a hold on the arithmetic monodromy group A once we have fixed K and 
F(X) € A(X). So after having done all the group-theoretic work yielding the 
above theorem, the question remains whether there are indeed arithmetically 
exceptional functions with the data in the theorem. 

The cases in (II) have been dealt with in [GMS97, Section 6] using variants 
of the rational rigidity theorem, and they are shown to appear over Q. The 
cases (I)(a)(i) and (I)(a)(ii) are very easy to deal with, and basically lead to 
cyclic polynomials X? and the Rédei functions (see [GMS97, Section 7]) in 
the first case, and the Dickson polynomials in the second case. Case (I)(a)(iii) 
appears also over the rationals, with the added feature that if A = Alt,, then 
Q can be any cyclic cubic extension of Q. 

The four infinite series in (1)(b) are intimately connected with isogenies 
of elliptic curves. A careful analysis is contained in [GMS97]. We want to 
remark that, as an alternative to the treatment in [GMS97], one can also 
use the branch cycle argument Theorem 3.2 to show that the cases (1)(b)(1i), 
(111), and (iv) do not occur over the rationals. Section 5 contains information 
concerning the first of these series, as we will need this setup to decide 4 
question of Thompson. 

As to (I)(c): Only (vi) has been shown to not occur at all over any numbet 
field K. Case (iv), which is the hardest, will be dealt with in Section 6. 

In the following we decide the existence over the rationals in al] othet 
cases, 


Cases (I)(c)(i) and (I)(c)(iii). In the first case the element of ordet 
8 is not rational in A, and in the second case the element of order 4 is not 
rational in A, so these cases do not occur by the branch cycle argument 
Theorem 3.2. 
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Case (I)(c)(ii). The given tuple (01, 02,03) is rigid in G. The elements 
g, and 2 are rational in G, and o3 is rational in A. The values of the 
irreducible characters of G at o3 generate the cyclotomic field A = Q(¢5), 
where ¢5 is a primitive 5th root of unity. The rational rigidity criterion 
thus gives a regular Galois extension L//(t) with group G and branching 
data given by the a,. The four conjugacy classes in G of elements of order 
10 are permuted transitively by Aut(G’), because A already permutes them 
transitively. Thus L is Galois over Q(t), see [V6196, Section 3.1.2]. One 
obtains that A = Gal(L/Q(t)). Let U be a subgroup of index 25 in A, and 
E the fixed field of U. Then A = GU, so E/Q(t) is regular. Further, the 
Riemann—-Hurwitz formula (2) shows that EF has genus 0, and is even rational 
because 02 has a unique fixed point, and so E has a rational place. Thus 
E = Q(x). Write t = f(x), and f is the desired rational function. 


Case (I)(c)(v). Here we have four different kinds of branching types. In 
the first one of type (2,4,6), we obtain that the associated triple of the o; is 
rigid and consists of elements which are rational in G. So the usual rational 
rigidity criterion shows that we cannot have A > G. 

Now suppose that we have branching type (2,2,2,4). Here indeed there 
are arithmetically exceptional functions with this data. It seems to be diffi- 
cult to exactly write them all down. Instead we give just one example, and 
show in Appendix A how we got it (and how to possibly get others). Set 
X(X* — 8X? + 12X7 — 48X — 28)? 

Oana 

and let F(X,Y) € Q[X, Y] be the numerator of (f(X) — f(Y))/(X — Y) as 
a reduced fraction. One verifies easily that f(X,1) is irreducible over the 
rationals (f(X,1) is irreducible even modulo 3), so F(X,Y) is irreducible 
over Q, hence the arithmetic monodromy group A of f is doubly transitive. 
On the other hand, one easily checks that F(X,Y) factors (into two factors 
of degree 4) over K := Q(V/2), so the geometric monodromy group G is 
not doubly transitive. There are only 3 doubly transitive groups of degree 9 
with a subgroup of index 2 which is not doubly transitive, namely AGL,(9), 
My, and AI'L,(9). From the branching over co and 0 we see already that 
G contains elements of cycle types 1'4? and 1124. Now let 8 be a root of 
22? + 88Z + 343. The numerator of f(X) — factors as (25X? + 16X? + 
38X?+100X +56 —28)(25.X° — 208X? —148.X2+4+702X +168X +112 —4)*, 
so f has two more branch points, and the inertia generators have cycle type 
1°23. None of the index 2 subgroups of AGL (9) and Mg has elements of this 
type, so we have A = ATL, (9), and the only not doubly transitive subgroup 
of A containing elements of the previous types is G = ADL,(9). 

From this function f we immediately get also a function corresponding 
to the branching type (2,2,2,2,2). Namely note that f(X*) = g(X)* for 
9 € Q(X). It is easy to verify that f and g have the same pairs of arithmetic 
and geometric monodromy group, and that the branching of g is as claimed. 


T= 
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Next suppose that we have branching type (2,2,2,6). We are going to 
show that this does not occur. To start with we need 


Lemma 4.2. Let Wh be a field of characteristic 0, € be an elliptic curve and 
pi — E be a K-rational morphism of finite degree of an algebraic curye C 
defined over Kv of genus 1 to €. Then C is an elliptic curve. 


Proof. Let J be the jacobian of C, and ® : C — J be the map such that 
® o o-! is the translation map T, on J by a point P, depending on VE 
Gal(A/K). Set UV := po @!:C  E. Without loss assume that Maps a 
fixed A’-rational point 0; to a K-rational point O¢, and that 0); and 0¢ are 
the zero elements of the respective additions on the elliptic curves. We get 


Wol=-v. 
So for Q a point on J, we get 
Y(Q) = W(Q+ P,) = (Q) 4+ W(P,). 


But @ = 0; shows W°(P,) = 0¢, hence VW = W? for all 7 € Gal(K'/K), so U 
and therefore also ® is defined over K. 0 


Now fix a branch cycle description (01, 02,03,04) of type (2, 2,2,6), and 
let A be the field of constants. Without loss let oo correspond to the element 
o4 of order 6. Two of the involutions, say 0, and o2, are conjugate in G, 
whereas o3 is not conjugate to them. So the branch point corresponding to 
o3 first has to be K-rational, but it is even Q-rational for otherwise (by 
the branch cycle argument) an element of Gal(Q/Q) would interchange both 
points corresponding to a, and a» with the one belonging to a3, because 01, 
@2, and o3 are conjugate in A, 

The group A has a subgroup U of index 4 with A = GU, so the fixed 
field of U in L is a regular extension of Q(t). The action of A on A/U is the 
dihedral group action, so there is precisely one group W between U and A 
of index 2 in A. Using the Riemann-Hurwitz genus formula (2) we compute 
that the fixed fields Ly and Lw of U and W both have genus 1. As 
is ramified in Ly /Q(t), we get that Lw = Q(t,z), where x? = q(t) for a 
cubic polynomial g € Q[T]. The zeros of q are the finite branch points of 
f, so in particular q has a rational root, so the elliptic curve X? = g(T) 
has a rational point of order 2. The inclusion Lw C Ly induces a rational 
morphism of a genus 1 curve with function field Ly to X? = q(T) of degree 
2. By the preceding lemma, Ly is thus the function field of an elliptic curve: 
Let (z,v) be a generic point on this curve with equation z? = q‘(v) for some 
cubic polynomial q’. Then Ly = Q(z,v). By the immediate part of [GMS97. 
Lemma 5.3], the isogeny of degree 2 gives t = R(v), where R € Q(V) has 
degree 2. But then we get the quadratic extension Q(v) of Q(t) inside Lu 
(and different from Ly, for instance because the genus is different), contrary 
to the fact that W is the unique group properly between U and A. 


ULLER: Arithmetically exceptional functions and elliptic curves 189 


Rational functions with branching type 
(2, 2, 2, 2) 


hroughout this section let A be a field of characteristic 0. We call a ra- 
onal function a (2,2,2,2)-function if it has exactly 4 branch points, and 
ranching type (2, 2,2,2). The following proposition gives a characterization 
{ jndecomposable (2, 2, 2,2)—functions of odd degree. The easy extension to 
the decomposable case (which we don’t need here) is left to the reader. 


Proposition 5.1. Let f € K(X) be a (2,2,2,2)-function of odd degree n 
which is indecomposable over kK. For a transcendental t let A := Gal( f(X)- 
t/K(t)) and G := Gal(f(X) — t/A(t)) be the arithmetic and geometric 
monodromy group, respectively. Then f has degree p™ with a prime p and 
me {1,2}, A=F? x H with H<GL,,(p), andG =F? x <-1>. 


Proof. We have G = <oj,02,03,04> with inertia generators o; of order 2, 
and 01020304 = 1. From o,0, = eae we see that the o; act by inversion 
on 0,02. For: # 7 set T;; = <a,0,>. We see as before that the o, act by 
inversion on 7; ,;. Note that 7). = 73,4, and so on. Also, a;02 commutes with 
g,03. From that we see that the normal subgroup N of G which is generated 
by the 7); is abelian and actually generated by 0,02 and 0,03. Also, N has 
index 2 in G, and therefore is transitive (there are at most 2 orbits, they 
have the same lengths, but the degree is odd.) Note that f indecomposable 
implies that A is primitive, so every non-trivial normal subgroup of A is 
transitive. Hence N is even normal in A, for otherwise N would embed into 
the direct product of the groups G/N® for a running through A, so N were 
a 2-group, contrary to odd degree. So N is a minimal and hence elementary 
abelian p-subgroup of A, which is generated by two elements. Thus A embeds 
naturally into the affine general linear group AGL,,(p) for m = 1 or 2, and 
the action of the a, on F” is of the form z+ —z +t; for t; € FY. (Note that 
by the Riemann—Hurwitz formula (1), each o, has exactly one fixed point.) 
In particular, G = Kee <—l>. O 


Remark 5.2. (A,G) is arithmetically exceptional if and only if H contains 
an element which has neither 1 nor —1 as eigenvalue. 


We now relate the arithmetic monodromy group A to the arithmetic of 
elliptic curves. For the applications in this paper we need only the case where 
One of the branch points is A-rational. A linear fractional change of f over 
K can move this point to infinity, but does not affect A. Thus suppose that 
S is one of the branch points of f. Also, we may and do assume that the 
unique simple point in the fiber f~'(oo) is also oo. Accordingly, write 
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with R,S € K(X), deg R =n, deg S = (n —1)/2. We may assume that R 
and S are monic. Let \; be the 3 finite branch points of f, and pu; be the 
simple point in the fiber of 4,. Set q,(X) = (X —A1)(X —A2)(X~—A3) € KIX 
and q,(X) = (X — p)(X — p2)(X — p3) € K LX]. We have ee 


a pane tale 
F(X) — A, = CX = Hi) SXF 
for @.(X) € A[X]. The roots of the monic polynomial Q1(X )Qo2(X )Q,/ X) 


are the roots of the monic numerator of the derivative of f, thus 


Q(X) Q(X )Q3(X) = f'(X)S(X)’, 
hence 
an(f(X)) = qu( X) F(X). 
From that we see that the morphism 
(x.y) (f(x), yf"(2)) 
induces a /’-rational isogeny of degree n of the elliptic curve E,, : ¥? = q,(X) 
to: By 2Y" = qy( X): 

The interesting question in this context is the structure of A, or more 
precisely, the field of constants of f. In [G@MS97, Proposition 5.4] we prove 
that the field of constants is generated over K by the X-coordinates of the 
finite points in the kernel of ¢, where ¢: E\, — E) is the associated isogeny. 


But these X—coordinates are just the roots of the polynomial S. Hence we 
get 


Proposition 5.3. Let f be a (2,2,2,2)-function as above, and K be the 
algebraic closure of K in a normal closure of K(x)/K(f(z)). Then K is the 
splitting field of S(X) over Kx. 


Note that A,/G, can be naturally identified with Gal(K/K). On the 
other hand Gal(K/K) acts on the elements of the kernel of ¢, which is an 
F,-space of dimension m, thus Gal(A’/K) maps into GLin(p). The induced 
action on the X—coordinates of these kernel elements is just the action of Ai 
on F”/<—1> (this follows from the proof of [GMS97, Proposition 5.4]), $0 
we get the following 


Corollary 5.4. Let f be a (2,2,2,2)—function as above. Then f is arith- 
metically exceptional if and only if Gal(S(X)/K) contains an element which 
does not fix a root of S(X). 


Remark 5.5. If the degree of f is a prime p (so m = 1 in our notation), then 
the condition on S is easily seen to be equivalent to $(X) having no root in 
K. For m = 2, a typical situation where the condition holds is when S(X) 
is irreducible over A’. Using Hilbert’s irreducibility theorem and a theore™ 
of Weber, one can easily construct for each odd p a function f of degree P” 
such that S is irreducible over A’, see [GMS97, Proposition 5.6]. 
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Let A be a number field. It is amusing to give a direct proof of the 
pmutation property of the functions f from the Corollary without going 
e detour over the group-theoretic equivalence of arithmetic exceptionality. 
Note that if the Galois group of S(X) contains an element which fixes no 
root, then S(X) has no root modulo infinitely many primes by Chebotarév’s 
density theorem. Thus we get a direct proof of the Corollary from 


proposition 5.6. Let Kk be a number field, and p be a prime of K, such 
that f, S, and the assoctated elliptic curves can be reduced modulo p. (That 
of course is possible for all but finitely many primes.) Let K be the residue 
field of the prime p. If S(x) modulo p has no root in K , then f mod p is 
bijective on K U {oo}. 


Proof. We work over the field K , and understand the coefficients of pre) 
and so on being reduced modulo p, so as being in K . The hypothesis gives 
that f(co) = oo, and f(a) # o fora € K. So we only need to show 
injectivity on A . By replacing f(X) with f(X + r) we may assume that 
qu( X) = XP + d,X + do. 

Suppose there is a,b € K , a # b, such that f(a) = f(b). Then there are 
u,v in a quadratic extension of K such that u? = q,(a) and v? = Gu(b). So 
by the previous development, the points (f(a), uf’(a)) and (f(6), vf'(b)) lie 
on the elliptic curve ¥Y* = q\(X). As f(a) = f(b), we have uf'(a) = +vf"(b), 
so by replacing possibly v by —v we may assume that uf’(a) = vf'(b). Also, 
wiv? € K ,souv€ K. (Note that if f'(a) = 0, then (f(a),0) is a point 
of order 2 on Y? = q)(X), so u = 0 as also (a,u) has order two by the odd 
degree of the isogeny induced by f.) Furthermore, the difference (a, u)—(b, v) 
lies in the kernel of the isogeny ¢. The X—coordinate of this difference hence 
is a root of S. The addition formula on elliptic curves gives: 


(a, u) = (b, v) = (a, u) TF (b, —v) 
(248) 
= (-a-—b+ .*), 


a—b 


but (utv)?=wtve+wek , so we get a root of S in K , contrary to 
the hypothesis. O 


5.1 Rational isogenies of degree 5 


If fe Q(X) is a function of branching type (2,2,2,2), and n = deg f is a 
Prime, then we get a rational isogeny of an elliptic curve over Q of degree 
m. According to a result of Mazur [Maz78, Theorem 1], this can happen 
Only for a few values of n. This is the reason for the short list of primes in 
Theorem 1.1(e). In Section 6 we have to look closer at the case n = 5. There 
are infinitely many elliptic curves admitting an isogeny of degree 5, however 
the possible j-invariants are restricted by the following: If ¢: E — E’ is 
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an isogeny of elliptic curves (everything defined over Q), then there is , 
absolutely irreducible polynomial (modular equation) F,(J, J’) € QL, J") of 
degree n + 1 in each variable, such that F,(j,j’) = 0, where j and j' ;. 
the j-invariant of E and E”, respectively (see [Sil94, Chapter ITT,§6]). Now 
suppose that n = 5. Then Fs; admits a rational parametrization as follows. 


see [Fri22, Viertes Kapitel]: 


(T? +107 +. 5)§ 


J => 
T 
72 t 3 
LOF 
Re eer with TT’ = 125. 


One can write T = R(J,J')/S(J, J") with R.S € QUJ,J']. So, as we have 
to have rational values for j and j’, the corresponding parameter is rational 
except for those pairs (7, 7’) for which R and S vanish. One computes that in 
these cases 7 = j’ € {1728, —32768, 287496, —884736}, and verifies directly 
(e.g. using the Maple package apecs for computations with elliptic curves 
[Con97]) that we cannot have rational isogenies of degree 5 in these cases. 
Conversely, if an elliptic curve has a j-invariant of the form el Cee for 
some non-zero rational 7, then one can compute that the 5th division poly- 
nomial has a factor of degree 2. So the curve has a 5-division point P whose 
X~—coordinate has degree at most 2 over Q. One easily checks that the group 
generated by P is Galois invariant, so dividing by this group gives a rational 
isogeny of degree 5. 
Summarizing, we get 


Proposition 5.7. Let j be the j-invariant of an elliptic curve over Q. Then 
the curve admits a rational isogeny of degree 5 if and only if 5 = ——— 
for some non-zero rational 7. 


In order to determine the field of constants, we also need 


Proposition 5.8. Let f(X) = R(X)/S(X)? be a (2,2,2,2)-function of de- 
gree 5 as above, qy(X), q,(X) be the associated cubic polynomials. Let j be 


‘ . + . ne 7 Py 3 
the j-invariant of Y* = q,(X). Then j = ee for some non-zero 


rational n, and the field of constants of f is Q( 5(n? + 22n + 125)). 


Proof. f induces an isogeny ¢ from Y? = q,(X) to ¥Y? = q\(X). Let @ be the 
dual isogeny. Then © := do¢ is the multiplication by 5 map on Y? = q,(-\): 
The kernel of @ of course is contained in the kernel of ®, so S(X) is a divisor 
of the 5-th division polynomial of Y* = q,(X). By the previous proposition, 


Gevont sy Let j’ be the j-invariant of Y? = q,(X). 


we may assume that j = 
Then j’ = + 10r'+ 5)? with 7’ = 125/n. Knowing 7’, we can compute thé 


discriminant of S(X). It is, up to a square factor, equal to 5(n? + 22n + 125): 
The result follows. C 
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Application to a question of Thompson 


s mentioned already in Section 4, it is usually a difficult problem to decide 
whether rational functions f(Z) € C(Z) with specific geometric monodromy 
groups (and branching data) are defined over certain small fields. In [Tho90] 
Thompson proposes the project to determine the cases where f is already 
defined over the rationals, and adds “ ... , an analysis which may require 
several years of hard work.” Specifically, he investigates the case of a certain 
degree 25 function, but has to leave undecided the question about small fields 
of definition. The purpose of this section is to give a different approach and 
to settle this question. We give the method in reasonably complete detail, 
as it works also in many other instances. As this function also appears in 
Theorem 4.1(1)(c)(iv) as a possible candidate of an arithmetically exceptional 
function, we give more details in order to also show existence of examples 
with the correct pair of arithmetic and geometric monodromy group. 


6.1 Thompson’s question, group—theoretic 
preparation 


Let W = APL(1, 25) be the affine semi-linear group acting on the elements 
of the finite field F,;. Thus W = T x(S ™ F), where T is the group of 
translations r+ «+t, S is the group of scalar multiplications z + az for 
non-zero a, and the group F' of order 2 is generated by the Frobenius map 
a 

For z a divisor of 24, denote by S(,) the subgroup of S$ of order 7, and set 
Wy = T x (Sq) « F). So Waa) = W. Set G = Wo), and note that W/G is 
cyclic of order 4. 

Let @ be an element of order 12 in F%;, and define elements o). 02, 03, 
04 € G as follows: 


a: rr Bz 
02:2 -2r+ 7 
04> ti By? 48 
04: BH 2? 
One immediately verifies that these 0; generate G. and that o,020304 = 1. 
(Here we write the action on F2; from the right.) The following table gives 
the cycle type and index of these elements: 
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| Cycle type | 113° jigte 15910 j5910 
f 


For natural action 


Thus, by (2) we see that (01, 02,03,04) is a genus 0 system, so there is 
a rational function f(Z) € C(Z) having G as geometric monodromy group 
and branch cycle description given by the o;. Thompson’s question [Tho99} 
is whether we can have f € Q(Z). We give an answer which also takes care 
of the different possibilities of the arithmetic monodromy group of f. 


Definition 6.1. Let K be a field, and spe € K(X) rational functions. We 
call f and f linearly equivalent over K, if there are linear fractional functions 


é,£2 € K(X) such that f(X) = &(f(Q(X))). 


Theorem 6.2. Let f(Z) € Q(Z) be a rational function with geometric mon- 
odromy group G and branching data as above. Let A := Gal(f(Z)—t|Q(t)) be 
the arithmetic monodromy group. Then there are, up to linear equivalence, 
exactly one such functions with A = G, and ezactly two with A/G = C». 

In the latter two cases, the field of constants is Q(V/5). 


The proof of this theorem is the subject of the following subsections. 


Remark 6.3. The normalizer of G in the symmetric group S25 is W. We 
have W/G = C4. Grouptheoretically, there is the third possibility that 
A/G = C4. We have not been able to prove existence in this case, though 
we have very strong evidence for that. 


Corollary 6.4. Up to linear equivalence, there are exactly two arithmetically 
exceptional functions belonging to Theorem 4.1(I)(c) (iv). 


6.2 Passing to a different rational function 


Let f be as in Theorem 6.2, and L be a splitting field of f(Z) —t over Q(t). 
Denote by Q the algebraic closure of Q in L. Then G = Gal(Z|Q(t)) and 
A := Gal(L|Q(t)), so A/G = Gal(Q|Q). 

Thompson’s idea [Tho90] was to look at the fixed field of T in CL over 
C(t). Then CL is an unramified Galois extension of this field (of genus 2) 
with Galois group Sig) x F. Here we rather work with fixed fields which have 
genus 0, and involve rational functions of type (2, 2, 2,2). 

Recall that A is one of the groups Woe), Waa), Way. If A = Wi), then 
A has, up to conjugation, the unique subgroup A') := W4i/3) of index 3. Set 


M ULLER: Arithmetically exceptional functions and elliptic curves 195 


(3) := AS) MG. Let L') be the fixed field of A) in L. As A = GA®). we 


3 


set that L) is a regular extension of Q(t) of degree 3. The action of the o; 
on the coset space G/G™) gives the following cycle types: 


aefala 


For action on G/G) 


By the Riemann-Hurwitz genus formula (2), we see that L') has genus 
0, furthermore L'°) is a rational field, because o3 has a unique fixed point on 
G/G), 

Let H be an F-stable F;-subspace of J’ = F2s. Then H is also stable 
under S(2) and S(4), so the semidirect product H x (Si:/3) x F’) is a subgroup 
of index 15 of W(;, for 2 = 6 or 2 = 12. 

Suppose |A/G| < 2, so A = Wi) for 2 = 6 or 12. Then let A") be the 
subgroup of index 15 in A constructed above, which is also a subgroup of 
index 5 of A®, As A = GA"), the fixed field L[5) of this subgroup is a 
regular extension of Q(t). The action of the 7; on G/G) gives the following 
cycle types: 


For action on G/G“) 


For the genus gris) of L''°) we obtain 
2(15 — 1+ gras)) = 10+6474+5, 


hence gris) = 0. As og has a unique fixed point on G/G"®), the field £5) 
has a rational place, so this field is rational. As the cycle types of the a; 
in this action are all different, we get that the branch points of L/Q(t) are 
tational. Write L“°) = Q(z), sot = g(z) for a rational function g(Z) € Q(Z). 
We get a decomposition g(Z) = g,(g2(Z)), with g; € Q(Z), deg g, = 3, and 
deg g. = 5, such that L') = Q(go(z)). 


6.3. Rationality question for |A/G| < 2 


We use the results from Section 5 to precisely pin down the function go of 
degree 5. As the branch points of g are rational, we may make the following 
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assumptions: oo corresponds to o;, and 0 = g;*(oo). Let 0 correspond tc 
ag3. The ramification information from above shows that the single point in 
the fiber gy'(0) is a branch point of g.. Assume that this branch point is oo. 
Without loss assume that 4/27 is the branch point corresponding to O4, and 
that g; has monic numerator and denominator. This gives 


(X=) 
xO 


Finally, let 1/y be the branch point corresponding to o2. The ramification 
information from above shows that gy'(1/u) consists of 3 different points, 
which are branch points of g.. What we get is that go is a (2, 2,2,2) function 
with branch points oo and the three roots of X? — u(X — 1)?. The elliptic 
curve associated to the branching data of g2 thus is (see Section 5) 


Yee ra wx a1): (3) 
The )-invariant of € is 
(mw ~ 6)° 
= 256——___.. 
? an Ap — 27 


On the other hand, as € has a rational isogeny of degree 5, its j—-invariant is 
of the form 


(7? + 10n + 5)8 
!} 


for some non-zero rational 7, see Proposition 5.7. This gives the algebraic 
curve relation 


Cs 256u(p — 6)°n = (4g — 27)(n? + 10n +. 5)%. 
The curve C is birationally equivalent to the elliptic curve 
E: V?=U*—7U* — 144U = U(U + 9)(U — 16). 


Using the MAPLE package [Hoe95] and some adhoc tricks, we get the fol- 
lowing birational correspondence, where ¢ := (n? + 10n +5)/(u — 6): 


27(864V + U4 — 36U% + 4320U) 
4(U — 36)U3 
36V + VU — 13U? + 108U 
er [72 
_ 9¢? — 36(n + 1)¢ + 144(n - 1) 
* 4(? + 4n — 1) 
9(2n + 7)C* — 36(m + 9)¢ — 144(3n +5) 
4(n? + 4n — 1) 


‘a 


Uh 


V = 
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emma 6.5. The field of constants of f is the same one as the field of 
constants of go. 


Proof. Of course, the field of constants of gj is contained in the field of 
constants of f. One verifies that the index of the core of A"@5) in Al) is 10 if 
A= Wo), and 20 if A = Wiz). So the degrees of the two fields of constants 
are the same. O 


We are now going to study the rational points on C via the rational points 
on €. 


Lemma 6.6. The finite rational points (ug, v9) on € are (0,0), (16,0), 
(—9,0), (—4, +20), and (36,4180). 


Proof. As 7 does not divide the discriminant of £, the rational torsion points 
of € map injectively to the F;-rational points of the reduction € modulo 
7, see [Si186, VII.3.1]. We compute that there are exactly 8 points modulo 
7 (including the one at infinity), so we are done once we know that the 
Mordell-Weil rank of € is 0. This however is well-known. Namely the linear 
transformation X = U/4—1, Y =(V —U)/8 maps € to the curve 


Y°pXY AY = Xoo Yea 0 ¥ = 10, 


which is C15 (one of the 8 curves with conductor 15) in the notation of 
[Cre97], and shown to have rank 0 there, confer [Cre97, page 110]. O 


Lemma 6.7. The rational points (j19, 7) on C are (135/128, —25/8), 
(—5/4, 25/2), (-675/8, —40), and (27/4, 0). 


Proof. As n? + 4y — 1 has no root in Q, the above transformation equations 
show that the only possible rational points (to, 0) which are not mapped to 
finite points on € have jp = 6. But that leads to n? + 10n +5 = 0, which 
has no rational solution. The finite points (uo, ¥o) on € which give points on 
C are those with ug # 0,36. (Those with ug = 0 or 36 give points on the 


projective completion of C.) C] 
We summarize: 


Proposition 6.8. Let C be the set of linear equivalence classes of rational 


functions g € Q(X) such that g(X) = gi(g2(X)) with 91592 € Q(X) and the 
following holds: 


(a) deg g; = 3, and g, has three rational branch points of branching type 
Oo 2, and 121. 


(b) deg g, = 5. Furthermore, g has, besides the three branch points of gy, 
also the different rational branch point 1/p, go is a (2,2, 2,2)-function 
with branch points gy*(1/) and a simple point gi (6), where 6 is a 
branch point of 9g, of type 112!. 


198 MULLER: Arithmetically exceptional functions and elliptic curves 


Let Q be the field of contants of g2. Then C has size 3, with g, and g, fs 
above, where jig = —675/8 gives Q = Q, and py = 135/128 or —5/4 gives 
Q= Q V5). 


Proof. The case (to, 70) = (27/4,0) is nonsense, whereas the other ratio. 
nal points on C give examples as stated. The claim about Q follows from 


Proposition 5.8 mo 
WwW 


6.4 Existence of f for |A/G| < 2 


We now use the functions g; from Proposition 6.8 in order to show that we 
get back the desired functions f whose existence we hypothetically assumed. 

Let Q be the field of constants of g2. It is clear that the geometric 
monodromy group of g is a transitive subgroup of the wreath product D5)D3, 
where Ds denotes the dihedral group of degree 5, and that in the case Q = Q 
the arithmetic monodromy group is a subgroup of the same wreath product, 
whereas in the case O= Q(V5) it is a subgroup of (Cs x Cy) 1 D3. Also, we 
know the cycle types of the branch cycle description of g. Using the computer 
algebra system GAP [$*95], one verifies that such a 4-tuple generates Woe) 
in its action on 15 points, and that this group G is selfnormalizing in Ds) Ds, 
and that the normalizer of G in (Cs x C4)0.D3 is Wi2). So the normalizers in 
these wreath products are just the expected ee monodromy groups 
A. Now let L be a splitting field of g(Z) — t over Q(t), and F be a fixed 
field of a subgroup U of A of index 25. One verifies that the o; induce the 
expected action on G/(GNU), also A = GU, so E has genus 0 and is regular 
over Q(t). Also, F = Q(y) is rational, because a; has a unique fixed point in 
this degree 25 action. Write t = f(y) for f(Y) € Q(Y), and f is the desired 


function. 


A Computation of the (2,2,2,4)-example 


Let K be the proposed field of constants. In G = Gal(L/K(t)) there is a 
subgroup U of index 6, such that the cycle types of a1, 02, 03, and a4 are 
1722, 1421 23, and 2'4!. Thus there is a rational function r(X) of degree 6 
over KY such that the fixed field of U is K(x) with r(z) = t. Let 6; be the 
branch point corresponding to o,. From the degree 9 action we see that 1 
and 64 are rational. Without loss assume that by = oo, the 4-fold point over 
by is oo, and the other one is 0. A consideration similar to the one in the 
(2,2, 6)—-case, utilizing the regular degree 4-extension over Q(t), shows that 
by and b3 are algebraically conjugate and generate KW’ over Q. Without loss 
assume by = —A, 63 = A, with A? € Q. If we make a further choice, namely 
assume that the double point above by is 1, then r is given by 


MBX? + 8BX? + 12K + 8BX + BX + 8B + 16)? 


r(X)=~2 
(36 + 248 + B2)2X? 


+ i, 
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where 2 € K. Furthermore, we compute 


(8 — 806° — 5048? — 17288 — 2160) 


ae 
; B4 + 4883 + 6482 + 17288 + 1296 


If we write @ = u+vX. with u,v € Q, and use that 5; has to be rational, 
we get a polynomial condition in u and v. This polynomial is the product 
of two genus 0 factors over Q, and it is easy to find rational points on them. 
One of them gives \ = V2 and 6, = 44/25. Of course, we have used only 
necessary conditions so far. Yet, nothing guarantees that the Galois closure 
of K(x)/A(t) is Galois over Q(t). However, in the specific case one can use 
an “almost—argument” to verify that. Namely express the coefficients of r 
in terms of \ = 2, and denote by 7 the function where we replace \ by 
—\. So the numerator of (r(X) — t)(F(X) — t) is in Q(t)[X] of degree 12, 
and the Galois group should have size |A| = 144. One now checks that using 
the computer algebra system KASH [DFK*96] for various specializations of 
t. So we have a good candidate for the location of the branch points in 
order to compute the function f € Q(X) of degree 9. The branching data 
gives polynomial equations for the coefficients of f. The resulting system 
is too big to be handled and solved by the usual Groebner basis packages. 
Instead, we use a MAPLE package by Raphael Nauheim [Nau95], which 
computes the solutions modulo a fixed prime, and lift them to p-adic numbers 
for sufficiently many digits in order to see periodicities and then guess the 
rational numbers. Once one has such a function, it is routine to verify that 
it has the desired properties, as we did in Section 4. 
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Tangential base points and Eisenstein power series 


HIROAKI NAKAMURA 


In this note, we discuss a Galois theoretic topic where the two subjects of 
the title intersect. Three co-related sections will be arranged as follows. In 
Part I, we review basic notion of tangential base points for etale fundamental] 
groups of schemes of characteristic zero. Then, in Part II, we introduce 
‘Eisenstein power series’ as a main factor of the Galois representation “of 
Gassner-Magnus type” arising from an affine elliptic curve with ‘Weierstrass 
tangential base point’. Part IIT is devoted to examining the Eisenstein power 
series in the case of the Tate elliptic curve over the formal power series ring 
Qi[q]] (introduced in Roquette [R], Deligne-Rapoport [DR]). We deduce then 
a certain explicit relation (Th.3.5) between such Eisenstein power series and 
Thara’s Jacobi-sum power series [I1]. 


I 


In [GR], A.Grothendieck invented Galois theory for general connected 
schemes. It is based on axiomatic characterization of a “Galois category” 
which models on the category Rev(X) of all finite etale covers of a scheme 
X. In this theory, the role of a base point of 7, is played by a certain 
“Galois functor” Rev(X) — {finite sets} which axiomatizes the functor of 
taking fibre sets over a “base point” for all covers in Rev(X). Then, a chain 
between two “base points” is by definition an invertible natural transfor- 
mation between such Galois functors. In particular, the fundamental group 
based at a Galois functor ® is the functorial automorphism group Aut(9), 
or equivalently, the automorphism group of the coherent sequence of finite 
sets {®(Y )}y erev(x) (with maps induced from those in Rev(X)) topolo- 
gized naturally as a profinite group. 

Recently, the notion of “base point at infinity” seems to be calling ce! 
tain attentions of Galois-theorists, according as fascinating problems of 
Grothendieck [G] are known (cf. V.G.Drinfeld [Dr], L.Schneps&P.Lochak 
[SL].) This notion was founded rigorously by P.Deligne [De] as “tangential 
base point” for more general 71-theory of motives (including Betti, de Rham™: 
etale realizations etc.) Still in the original Galois context, G.Anderson and 
Y Ihara [Ai] initiated effective use of Puiseux power series to represent such 
a base point, which has led to a number of practical applications in Galois 
Teichmuller theory ({IM], [IN], [Ma], [N2,3],...) Inspired from these works: 
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n this paper, we shall employ the following simple definition of a tangential 
hase point. 

1.1) Definition. Let X be a connected scheme, and k((t)) be the field of 
Laurent power series in ¢ over a field k of characteristic zero. A k-rational 
tangential base point on X is, by definition, a morphism v' : Spec k((t)) + 
X. This amounts to giving a scheme-theoretic point z of X together with 
an embedding of the residue field of x into the field k((t)). (The point x 
may or maynot be a k-rational point of X; see examples below.) 


A basic motivation to introduce the above definition is that it has been 
often the case that a certain role of a “base point at infinity” can be played 
by a generic point of a 1-dimensional subscheme with specified 1-parameter 
“4”. Let us explain how such a tangential base point 7 could work in the 
study of Galois representations in fundamental groups. Following Anderson- 
Thara [Al], we fix an algebraically closed overfield Q = k{{t}}, the field of 
Puiseux power series in the symbols “t!/"” with (1!/™" ym = ¢1/” (m,n EN), 
which is the union of the Laurent power series fields k((t!/”)) for n € N. 
Given such a ¥ (and Qs), for each cover Y € Rev(X), we may associate 
the set of its Q;-valued points Y(Q;z). This is a finite set as the fibre of the 
finite etale morphism Y — X over the geometric point von X. Noticing also 
that every morphism Y’ — Y in Rev(X) induces a natural map Y’(Q;7) > 
Y(Qz), we get a coherent sequence of finite sets {Y(Qz)} indexed by the 
_ objects Y € Rev(X), or equivalently, a fibre functor &; : Rev(X) — {finite 
sets} (Y ++ O5(Y) = Y(Qz)). 

Now, the absolute Galois group G, = Gal(k/k) acts on Qy by the co- 
efficientwise transformation of power series i, Ggt® + Daeg F(Ga)t®, 
hence induces an automorphism of the sequence {¥ (Qs) }yerevix) coher- 
ently. Thus, we obtain a natural homomorphism 


87: Gp m1(X,v) = Aut(®;). 


When X is defined to be geometrically connected over k, then sz gives a 
splitting of the canonical exact sequence 


2) a eS OG, _s 4, 


By conjugation, sz defines a Galois representation 
yo : G, — Aut(m (X_,70)) 
which lifts the exterior Galois representation 
yp: Gy — Out(m(Xq, v)) 


induced from the exact sequence (1.2). Generally speaking, the group- 
theoretic character of y is independent of the choice of base points, while 
that of yz is dependent on 7. 
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Example 0. Any k-rational point x € X(k) gives automatically a k-rationa] 
tangential base point via k — k((t)). 


Example 1. Let X = Po — {0,1, co} be the projective t-line over Q minus 
the three points t = 0,1,00. Then, the residue field of the generic point és 
of X can be identified with the rational function field Q(t). The obvious 
embedding Q(t) —+ Q((t)) determines a morphism 


Spec Q((t)) + X= P; a {0, Recon @ 


whose target lies on the generic point x of X. We call the tangential base 
point given by this morphism the standard tangential base point on X, ang 
denote it by 01. Note that the notion of O01 depends on the choice of the 
normalized coordinate t of P’ setting the 3 punctures to be t = 0. 1,0, 
Now, we have a natural compactification P' of X, with respect to which 
the above 01 can be extended to the morphism Spec Q|[t]] + P}. This 
means that the “point” determined by 01 is not only the generic point x of 
X but also a uniformizer of the (completed) local ring at t = 0 on P}, ic., 
a 1-dimensional tangent vector (consisting of ‘direction’ + ‘speed’) starting 
from t = 0. (If we change normalization (e.g., scale) of t € Q((t)) relative to 
the standard coordinate ¢ of P!, the represented vector will differ from 01. 
In a few contexts where X is regarded as the elliptic modular curve of level 
2, another tangential base point «201” plays a crucial role, as pointed 


out in [N2-3].) According to this realization, we usually picture 01 as a 
unit tangent vector rooted at 0 towards 1. Fix an embedding Q — C so 
that the geometric fundamental group 71 (XG, 01) may be identified with the 
profinite completion of its natural Betti correspondent. Then, ™1(XG, 01) is 


a free profinite group F» freely generated by the standard loops z. y running 
around the punctures 0,1 respectively. 


It is known that the Galois representation y5; embeds Gg into AutF» in 
such a way that 


(1.3) o(z) = 2), o(y) = fa(2,y) 1 y foley) (¢ € Gg) 


with fo(z,y) € [F2, 2] (G.V.Belyi), where y : Gg — Z* denotes the 
cyclotomic character. The pro-word f, is uniquely determined by the above 
formula, and plays a central role in the Grothendieck-Teichmiller theory. 


Example 3. (Ihara-Matsumoto [IM]) Let Xp, be the affine n-space A} minus 
the discriminant locus D, whose geometric fundamental group is isomo!- 
phic to the profinite Artin braid group B, generated by 7,...,T-1 with 
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mations 77; = 7;7; (|t—j| > 1), it417% = Roi (1 Si < n—1). The 
-overing space Y,, corresponding to the pure braid group P, C B, can be 
aturally regarded as {(t1,...,tn) € AS | ti At; (¢ F 7)}. Define a tangen- 
jal base point uv” : Spec Q((t)) — Yn viate (aed ,t”), and let v be the 
rojection image of v’ on X,. Then, it turns out that ys : Gg — AutB,, 
provides the Galois representation of the form: 


i o(m) = Te 


1.4) . 
Lott) = folyi Ph fay?) Wsisn=1), 

where ¥j = T%-1°°''71°71-::Tj~1. This Galois action is compatible with 
Drinfeld’s formula discovered in the context of quasi-triangular, quasi-Hopf 


algebras ([Dr]). 


Example 4. Ifaspace X is given as a modular variety parametrizing certain 
types of objects, then to construct a tangential base point v : Spec Q((t)) 
X is equivalent to constructing such an object defined over Q((t)). To do 
this, sometimes, formal patching method turns out to be useful in smoothing 
aspecially degenerate object Yo/Q over Q|[t]] whose generic fibre Y,, /Q((t)) 
defines ¥ with desired properties. We refer to [IN], [N2-3] for some of such 
examples of tangential base points constructed in the moduli spaces My n 
of the marked smooth curves. The method will also produce a “coalescing 
tangential base point” v(g) on the Hurwitz moduli space H(G;Ci,...,C,) 
associated to a Nielsen class g € Ni(G,C). Indeed, for any given transitive 
permutation group G C S, and for any generator system g = (gj,...,gr) 
with gi---g, = 1 (g; lying in a conjugacy class C; C G), define r — 2 
Btiples {(z;,y;,2:) | i = 1,...,7 — 2} by setting 2; = 91 +--+ 9:3, 4% = 9341, 
4 = git2°'-gr. Then, since z;y;2z; = 1, each (2;, y;,2;) (regarded as a 
branch cycle datum) defines a (not necessarily connected) branched cover 
Y; — P!' ramified only over {0, 1,co}. One obtains then an admissible 
cover Y; = U;Y;/ ~ over a linear chain P of P},., such that ~ identifies 
the branch points on Y; and Y;41 according as the cycle orbits by beets 
a.) in {1,...,n}. One expects that suitable techniques for smoothing 
Y = P (cf. [HS], [W]) should yield a good v(g) on the Hurwitz moduli space, 
generalizing a prototype example given in [N2]. We hope to investigate some 
aspects of this construction in a circle of ideas of inverse Galois problems 


([Fr}), 


I] 


In {{1], Y Ihara discovered deep aspects of arithmetic fundamental groups 
by interpolating complex multiplications of Fermat jacobians encoded in 
™(P! — {0,1,00}). Among other things, he introduced new l-adic power 
Series F, € Zi[[T1,T2]] (¢ € Go) whose special values at roots of unity 
Tecover Jacobi sum grossencharacters. He also conjectured the explicit forms 
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of Galois characters appearing in the coefficients of F, in terms of Soule’s a 
adic cyclotomic elements, which was settled by Anderson [A], Coleman [C). 
{hara-Kaneko-Yukinari [IK Y] (see 3.6 below). Ihara [I2] also developed an 
l-adic theory of Fox’s free differential calculus to control this power series F, 
in the framework of combinatorial group theory. This enables one to relate 
F, and f, of the previous section in a very simple way. We shall employ this 
treatment also here in a slightly more general setting applicable to higher 
genus curves. 

Let X be a smooth projective curve of genus g over a field k of charac. 
teristic 0, S a non-empty closed subset of X with geometric cardinality n. 
and let C = X —S be the affine complement curve. Practically, we shall be 
concerned with the “pure affine hyperbolic cases” of (g,n) = (g,1) or (0, n) 
(g > 1, n > 3), where the geometric fundamental group is a nonabelian free 
profinite group with its 1-st homology group being pure of weight —1 or ~9 
respectively. 

Fix a rational prime p, and pick a k-rational tangential base point v on 
C’. The Galois group Gx acts on 7(C%, v) and hence on its maximal pro-p 
quotient 7. We shall write this action as pe Gy — Aut(m). Note that x 
is a free pro-p group of rank r := 2g +n—1 and that its abelianization H 
is canonically identified with the p-adic etale homology group H (Cj, Z,) 
which is a free Z,-module of rank r. Our interest will be concentrated on 
the kernel part of the composition map: 


. 2) 
(2:1) BEE Ge. Au) CLA), 


The fixed field of the kernel of p\”) (resp. the kernel subgroup ker(Aut(7) > 
GL(#)) of Aut(a)) will be denoted a ln (resp. Aut)7). 


In order to analyze the restriction | p; 4 lay, ,, closely, we construct a certain 
combinatorial (anti-)representation 


(2.2) A: Autim > GL,(Z,[[H]]) 


in analogy with the Gassner-Magnus representation in combinatorial group 
theory (cf. Ihara [I2], see also [Bi], [Mo] for topological aspects). Here: 
Z»([H]} is the abelianization of the complete group algebra Zp[[x]] which 
is by definition the projective limit of the finite group rings (Z/p"Z) )[/N] 
over the open normal subgroups N C 7 and n€N. The Gassner-Magnus 
representation is a basic device to look at operations on the maximal met@- 
abelian quotient of (cf. 2.7 below). To give its precise definition, we shall 
first introduce some terminology of free differential calculus. 

If {z,,...,2,} is a free generator system of 7, then, as shown by Lazard 
[La], Z,[[w]] can be regarded as the ring of formal power series in nol 
soa variables t; := 2; — 1 (¢ = 1,...,r) over Z. Each element 
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Z,|[7]] is then written in the form 


\=)oay-w (ay € Zp), 


Ww 


here w runs over all finite words in {t;,...,t,} with non-negative expo- 
ents including the unity 1. We call a, the constant term of \ and denote it 
Ny e(\). Classifying the other terms ay, -w (w # 1) of \ according to the right 
most letters, we may write uniquely A = e{\) + am Ait; (Ai € Zp[[7]]). 
his A; is by definition the 2-th free differential of A, and will be denoted 


“. Or 
2.3) A= e(4) + D5 —(2 
i=l ; 


n the following, we use capital letters X,;,7; to designate the images of 
i,t; € Zp[[7]] in the abelianization Z,[[H]] (i = 1,2). Obviously, it follows 
hat 

Z(H = pl... (=X). 


For a general element \ € Z,[[7]], we write \*° for its image in Z,[[H]]. 


(2.4) Definition. For a € Aut;7, define its Gassner-Magnus matrix by 


2.5) Proposition. The mapping A: Aut;7 > GL,(Z,[[A]]) (a+ Aq) ts 
an anti-representation, t.€., Aga = Agi Aa (a, a’ € Aut 7). 


Proof. From direct computation, we have 


a Jal (x; Ja(re)) 
aa'(x;) sania ue Day eee) . = (x; — 1) 


The result follows at once from the fact that a acts trivially on Zp ([H]]. O 


(2.6) We note that the above construction of & depends on the choice of 
the free basis (21,...,2,) of w. In order to see its dependency on the choice, 
it would be convenient to introduce, more primitively, the Magnus matrices 


a for a € Aut(7) by 
Oa(x; 
ae 1<i,j<r 
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with entries in the non-commutative algebra Zp ([7]] satisfying the anti-]_ 
cocycle property: Y%ag = a(Ag)- A, (a, HE Aut(7)). Then, one can show 
that if the free basis (4,,...,2,) is replaced by another basis (z4,-..,24) 
then the respective Mas nus: matrices 2%,, 2/, are related by “Jacobian iss 
trices” as follows: 


Oa ote! a) : ae a € Aut(7)), 
7" Ozh... a = “\ (zi... aaah Rf (7r)) 
Since a € Aut,7 acts trivially on Z,[[H]], the above implies that the 
Gassner-Magnus matrix 2) w.r.t. (a},...,2/) is just the conjugation of 
%, by the Jacobian matrix (Bree) ab 


r )/ 

The usefulness of the anti-l-cocycle representation of Aut(z) through 
Magnus matrices was shown by Anderson-Ihara [AI] Part 2 in their close 
study of Galois representations in 7;(P! — {n points}) (where a more ‘geo- 
metric’ variant was employed). Independently, Morita [Mo] presented effec- 
tive applications of Magnus matrices in his theory of “traces” of topological 
surface mapping classes. 


(2.7) We shall now briefly explain how the Gassner-Magnus representa- 
tion looks at the meta-abelian quotient of 7. Let 2” be the double commu- 
tator subgroup of 7, i.e., 7” = [n’, 2’] where x’ = [x,7]. Then, the pro-p 
version of Blanchfield-Lyndon theorem (cf. Brumer [Br] (5.2.2), Thara [I2] 


Th.2.2) tells us an exact sequence of Z,[[H]|-modules: 


(BL,) 0a’ /a" % ZA] 2,29 T(t) ~ 1H) = 0, 


where /(*) denotes the augmentation ideal of Z,[[*]], and the maps 0, 6 are 
defined by O(a mod 7”) = 1@(a—1), 6(b@c) = b-c*°. Since I(7) is known 
to be the free Z,[[m]]-module of rank r with basis 2; — 1 (¢=1,..., r), one 
can identify ie. middle module with @;_, Z,[[H]] @ (x; — 1) ~ 7,, es 
so that 


Ne Ja PF 9s 
Ha) = (GI ait (Ge)*) € Blt 


Each automorphism a € Aut(7) acts on the modules of (BL,) compatibly, 
especially on the middle one by a(b @c) = a(b) @ a(c). In particular, 
a € Aut;(m) acts on it Z,[[H]]-linearly with matrix representation given 
by the (transpose of) Gassner-Magnus matrix %,. Noticing that 2/7” 1 
embedded there by @, one sees at least that the representation of Aut,(7) 
in m’/7" should be analyzed well by the Gassner-Magnus matrices. 

Returning to the situation of Galois representation pl?) ; Gy — Aut(7): 
our main concern thus turns to look at the composition with the Gassne!- 
Magnus representation: 


= (p) 
Ar = Aov! 


Ghiri : Gia.) == GL, (Z,[[]}). 
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the remainder of this section, we review known results on the most basic 
¢wo cases of (g,n) = (0,3), (1,1). The former is Ihara’s original case C = 
p!—{0,1,0o} ([11.12]), and the latter case is for C = an elliptic curve minus 
one point, which was introduced/studied by Bloch [Bl], Tsunogai [T] and 
the author [N1]. 


Base 1: C = P! — {0,1, 00}, @= 01, 21 = 2, eo = y. 
In this case, it follows from computations that 


- l 0 
Maile) = ( Gate < 1)) * (1 + sit (x - 1)) *) 


for ¢ € GQ(u,0)- (Note that Q(1) = Q(uzpx ) now). The power series 


— Of, = 
Py lists) = det A(0) = 1+ Dz (®o - 1) 
2 


is called the universal power series for Jacobi sums, or Ihara’s power series 
({11,2], [fc], [Ma]). In fact, Ihara showed that the mappings g > Fo(Gon — 
qe — 1) (1 < a,b < p”) represent Jacobi sum grossencharacters over 
Q(p»), and also investigated the p-adic local behaviors of the coefficient 
characters. As in {I1], F, can be defined for all o € Gg, but in the present 
paper, we content ourselves with treating it only over Q(u,~). By the above 
definition and (2.6), we see that F, in this range is determined only by the 
abelianization of the free basis (71,22) of 7. In view of the above (BL), 
specialized to this case, the module 7’/m” turns out to be a free Z,[[H]]- 
module of rank 1 generated by the class of [41,22] = £1 Loe, 25" mod 7”. 
The image of @ is generated by O(([x1,22]) = (—T2,71) € Z,[[H]]®?, and 
from this follows that A>(c) (hence ysz(c)) acts on Im(d) = a’/n” by 
multiplication by F,(7),72). This was in fact Thara’s original definition of 
F, in {11}. 


Case 2: C: y? = 443 — gox — 93 (g2,93 € k), A = 93 — 2795 £ 0. 

In this case, we will take suitable generators z1, 22, z of m with [z,, z2]z = 
1 so that z generates an inertia subgroup over the missed infinity point 
Oe€X. For a tangential base point, we take t : Spec k((t)) — C defined by 
t := —2zx/y and call it the Weierstrass base point. The fixed field &(1) of the 
kernel of Gy-action on H = m/[m, 7] is the field generated by all coordinates 
of the p-power division points of F over k. As shown in [N1] §6, there exists 
a unique power series €,(7), 72) € Zp[[T), T9]] such that 


TT, —-T? ) 


for alla € G1). It is easy to see that €, depends only on the abelianization 
image (Z1, £2) of the free basis (x1, x2) of 7. We shall call €, the Eisenstein 
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power series associated to the Weierstrass equation y? = 423 — 92% ~ g, 
and the basis (£1,022) of H. As in Case 1, again, the image of a! /x! in 
Zp [[H]]}®* via 6 of (BL,) is the free Z»|[H]]-module of rank 1 generated 
by O(z) = (T2,—-T1), but this time the action of A, (a € Auti7) on this 
image is trivial. This means that €, (o € G1)) should be understood as 
an invariant of the ‘unipotent’ action of y(c) on the extension of (BLY, 
In fact, using Bloch’s construction described in [T],[N1], one can show more 
explicitly that 


ya(o)(1 @ (x; — 1)) = 18 (aj — 1) + €&(T1,T2) T; - A(z) (¢ = 1,2) 
holds for ¢ € Gy1) in Zp[[A] Oz, yay T(z). 


(2.8) Remark. In [N1], we employed a special section s : Gray > 7(C) 
characterized by a certain group theoretical property instead of that induced 
from the above te. The power series a, given in loc.cit. is the same as E 
except that it misses constant term. If p > 5, the constant term of Es is 
tpa(c) where pa : G1) — Zp is the Kummer character defined by the 
p-power roots of A. 


In the next part III, we will show that €, for the Tate elliptic curve over 
Q((q)) degenerates to a “logarithmic partial derivative” of Ihara’s power 
series F,. 


II 


In this section, we shall examine the Eisenstein power series arising from 
the Tate curve T =“G,,/q@” over the rational power series ring Ql{q]] in 
one variable q. The affine equation defining T (minus the origin O) is 


(3.1) y + ay = 2? + a4(q)e + a6(q), 


where a4(q), a6(q) € Q\[q]] are given by 


aig) B5a(ah;. alg = besa) aT), 


es) 
1Z 


n* n 
s2(a) = > oe(njo" = Fe ae (k > 1). 


n>1 n>1 


The equation modulo q is y* + zy = x3, hence T has a split multiplicative 


reduction. Indeed, the Q((q))-rational points of T — {O} are uniformized 
by we Q((”)* \ q@ through the formulae: 


z(u,q) = D> #2 Bail g), via) = Yo ts + sla) 


l—q"u)? 
ay 7 néZ 
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nd the special fibre T, at g = 0 may be regarded as the nodal projective u- 
ne with two points u = 0,00 identified (cf. [Si] V §3.) The completed local 
gighborhood at u = 1 in T can be identified as Spec Q[[¢, u — 1]] whose 
eneric fibre is the spectrum of Q[[q, u — 1]] @gqgy Q((4)), the ring of formal 
jower series in u—1 with bounded coefficients from Q((q)). In the latter ring, 
ve may arrange the mapping q +> t', u—1++t’ to define a tangential base 
noint valued in Q((t’)) on the generic fibre 7, /Q((q)) of T minus the origin 
C _ We write this tangential base point as € : Spec Q((t’)) — T, — {O,} 
and call it the Tate base point. In the following, we shall look at the Galois 
representation yz: Gg — Aut m1(T; \O), where T; \ O denotes the generic 
geometric fibre of T — {O}. ; 

First, let us connect the above Q-rational base point t with the Q((q))- 
tational Weiserstrass base point ~ on the generic elliptic curve 7, (intro- 
duced in the previous section). Indeed, we see that these two base points 
give essentially the same Galois action on 7(J \ O) as follows. First, let 
us apply the change of variables “X = 2+, Y = 2+ 2y” to (3.1) to get 
the equation of Weierstrass form 


(3.2) ¥? =4X° — go(q)X — ga(q). 


where 


B . 
g2(q) = 20(- =. +S = o3(n)q"), 


n>1 


| (Bz = —1/30, Bs = 1/42 are the Bernoulli numbers. ) Then, as explained 
in the previous section, the Weierstrass base point on T, — Oy is defined 
as a tangential basepoint valued in Q((q))((t)) by putting t = —2X/Y. Our 
“claim here is that the Galois representation yz : Gg — Aut (Tj \ O) is 
essentially the same as the composite of yg : Goy(g)) = Aut 7(Tqz \O) with 
the map Go — G, Q((q))> Where the last map is the one obtained from the 
coefficientwise Gg-action on the Puiseux power series in Q(( Q((q)) Ot {a}. 
Indeed, since x and y can be written respectively in the forms (w—1)~?(1+ 
Dn Om(t — 1)™), —(u — 1)73(1 + Bm (u — 1)™) with am, Pm € Q[[al] 
(cf. (si) V i the coefficientwise Gg-actions on the two rings Q{{q'/* ee"), 
Ql 1/N (uw — 1)'/N]] are compatible with their natural identification via 
t= -2X/Y = = u—1 mod™ (1+ (u—1)Q[f[¢,u— 1]]). From this, the above 
telation of yg with »; follows. 
Now, fix a rational prime p, and consider the p-adic Tate module H = 
lim, T;[p™]. As is well-known, in the Tate curve case, H is an extension of 
Z, by Z,(1). But in our case, we may split the extension in a natural way as 
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follows. In fact, by the Tate uniformization by “G,,” of T;, the P™-division 
points T;[p™”] may be identified with the subset {Com q?/P™ LOS abe p™) 
of Q{{q}}* with natural Ggy,))-action. (Here C)m is a primitive p™ +4} 


roots of unity; we select those so that 7, 2 = Cpn-m (0 << m< rn) once and 


for all.) Thus, we can take generators 21,22 of H as projective sequences 
{q!/P"}, {Gm} respectively to obtain a splitting H = Zpt, ® Zp(1)zo. 
Let us then consider the maximal pro-p quotient 7 of 7,(T;\O, f) and the 


©) - Gg — Aut(m). Then, with respect tp | 


the above basis (£1, 22) of H, we have the Gassner-Magnus representation 


A>: Gg — GL2(Z,|[H]]), which yields the (Tate-)Eisenstein power series 
EX(T;,T2) € Zp[[H]] (o € Gocypeo)) defined by 


associated Galois representation y 


= 7 —~T! 
Axo) =1e.+ & oi 9 


(3.3) Theorem. Let U; = log(1+T7;) (¢= 1,2). Then, in Qp[[U1, Uo]], we 


have 


5 f a Xm41(C) Us" ‘ 

€5(T), 72) —_ S a a (o a GQ ,00 )). 
m>2 
even 


Here viz 2 GQ(up2) — Z,(m) ts the m-th Soule character defined by the 
properties: 


\ 5r(o—1) 


I] a-o)°"" = (vag). 
1<a<p” 
p{a 


Proof. The statement follows from a more general formula given in (N1] 
which states that the coefficient «;;() of UjU3/(1—I'*/ )ilj! ((i, 7) & (0,0) 
is determined by the following Kummer properties: 


/ \ sr (o—1) 


ei a =e (Un 241), 


where, for 0.<,¢,b6 <.N =p", 


y 
re 
bo 


N) 6 B2(2,) -66(4—-1) a a 8s 
O5n = PPM NN” 11 — CR) [[ at ech) — a" 8 Cy’) 
n>1 
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ere B2(T) = T’?~T +3 is the second Bernoulli polynomial. Observing that 
e coefficientwise GQ 900) action on the p-power roots of gir" is nontrivial 
nly when a = 0, and noticing that 0* = 1 only when i = 0, we see that 
- occurs nontrivially only when 7 = 0, in which case it is aqiial LO. XFL is 
Phe constant term turns out to vanish according to Remark(2.8) applied to 


A(g)=@nsid-a")". 0 
The above result (3.3) may also be deduced by an alternative method 
yelating £5 explicitly with Ihara’s power series F,. We begin with 


(3.4) Theorem. One can take surtable generators 21, 20,2 of (Ta \ O, €) 
with [z1,22]z = 1 such that (x1, 22) lofts (21,22) above and that the Galors 
representation pr : Gg — Aut (Ty \ O,t) is expressed by the following 
formulae in terms of (x(¢), fo) of §1 Example 1: 


1—xle) = = = 
vy ree : fo(ziz9z}', oa) - 


to + fo(a), z)a%” f,(25', 2) 


zoe 2X(9) 


Proof. This assertion is essentially [N3] Cor.(4.5), except that the choice 
of generators differs from loc.cit. We first consider ‘Gy,/q"™’ (n > 2) over 
Q\[q]] and realize the fundamental group of generic geometric fibre minus 
sections (one for each component) as a Van-Kampen composite of copies 
m(i) (« € Z/nZ) of m(P! — {0,1,00}). Identifying r(z) = (0;,1;,00; | 
0;1;00; = 1), we compute the composite as the amalgamated product of 
the 7(z)’s and (e) over the relations 007! = 0:41 (0 <i<n—1), &p-1 = 
e0ge—!. Setting then standard generators xj = e, 2 = UR 2; =" bey 
(1 <i <n), we get the relation [z;,29]z1--+ 2) = 1. Then, [N3] Th.(3.15) 
computes the limit Galois representation yz on these generators in terms 
of the parameters x(c), fo. The desired Galois representation follows from 
this computation after reducing z} = z, zg = -:: = Zn = 1 (and checking its 
subtle independence of n). O 


Using the above, we shall sonigute Et directly from the definition. Note 
—1 divided by —T,T2 for  € Gavu,e): 


dyp\? sexes) 


that it suffices to look at - 
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First, we compute 


Op? (o)(z2) | Ofpwafz? yee 
ee -l|l= 7 lee total? 


Tie 1, 


foro € Coup ew), Which maps to —T,( Siz )2 + 0 in Z,[[H]]. But recalling 


fo = fo(ez',z) here, we have 


Ofo _ Ofo(xz",z) Bez" | Ofo(az',z) Oz 


Oxo Or. Ory Oz Oxo 
where its first term must vanish in Z,[[H]] because —1) is 
equal to f, — 1— Bfe(Ey 8), — 1) which vanishes in Zp[[H]]. Then since 


(22)2° = _T,, it follows that 


OQ» 


Ofe(Z5 32) /,.~1 
Seaia *) (33 
Oo 2 + 


; soos Of, (wy, . F(W, —1.W.-1)- 
Ea Ts) = —( Him, Pea et te lim eat 
Werk Wat 
In terms of the variables U; = log X; (i = 1,2), we conclude (after de 


VHospital’s lhmit rule) the following relation between the Tate-Eisenstein 
power series and Jacobi sum power series. 


(3.5) Theorem. 


a 


- 0 
ta(T i jis) = log F,(S, z) = i U2)- 
LS 


OT 


forage GQ( ipo ye 


This sort of relation between genus | and 0 was first expected by Takayuki 
Oda in his comments on a seminar talk by the author at RIMS, Kyoto 
University in 1993. The above formula was then obtained in the course of 
studies along [IN, N3] with Y.Ihara. Theorem (3.3) can then be deduced 
also by combining Theorem (3.5) with the following formula: 


(3.6) Theorem. (Anderson [A], Coleman [C], Ihara-Kaneko-Yukinari 
[IKY]) 


m>3 i+j=m J: 
\ odd 7,j21 


The explicit formula (3.6) was proved by Anderson [A], Coleman [C] and 
Thara-Kaneko-Yukinari [IKY] independently around 1985. Later Ichimur4 
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c], Miki [Mi] gave simplifications of the proof. All hee nies so far 
epended on the interpolation properties of the values F,(C! ee ere — 1) 
y Jacobi sums. 


* * * 


Recently, (in a more general profinite context) Ihara [I3], using the 5- 
cyclic relation of the Grothendieck-Teichmuiler group, gave a purely alge- 
praic proof of the factorization 


Fo(T1,T2) = To(T1)Po(T2)/Po((1 + M1) + Ta) — 1), 


where ,(T) € W,[[T]] is Anderson’s Gamma series [A] (Wp: the ring of 
Witt vectors of F,). In particular, F, has to be of the form 


cn uiv2 

Fo = exp( ) | &m oD ‘i 7) 
m>3 itj=m “J: 
odd 


with some constants c,,. Then, he derived ¢m = ¥m(o)/(I"™~! — 1) by a 
direct method observing meta-cyclic covers of P! — {0,1, co} (cf. also [De] 
§16 for the last technique). Thus, we now have a purely geometric proof of 
Theorem (3.6) without use of Jacobi sums. 

Returning to our elliptic context, we see that combination of Theorems 
(3.3), (3.5) may also reconfirm the same values of c,,,’s independently, lead- 
ing us to an elliptic interpretation of the logarithmic derivative of Anderson’s 
Gamma series (with constant term dropped): 


Dlogl',(Ts) — DiogI',(0) = E4(T), To). 


If p-adic Tate curves <Q / qr? ™ (n € N) are employed instead of a single 

Tate curve over Q[[g]], then it can be shown that those Eisenstein power 

series fe? (0, T2)}nen produce a Q,-valued distribution whose ‘asymptotic 
_ expansion’ gives a power series 


(P) ( UF 
2 pa emt q \Pma1(o eect 
m>2 
even 
for o in the ramification subgroup R of GQ,(upo)s Where Ym : R — Zp 


tepresents the m-th Coates-Wiles homomorphism, and E?(q) is the p- 
adic Eisenstein series s6p(1 —s)+ + oP) | n)q” of weight s introduced by 
J.P.Serre [Se]. For this and other Bmiehinetic aspects, we will have more 


discussions in subsequent works. 
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Braid-abelian tuples in Spr(Iv) 


John Thompson * and Helmut Vélklein ?. University of F lorida 


March 1997 


Abstract: We study a class of generating systems of PSpn(q) of length 
m+ 2, where n = 2m. These systems are not quite rigid, but very close: 
The pure braid group induces an abelian group of permutations on inner 
classes of these tuples (actually, an elementary abelian group of order 2): 
The corresponding Hurwitz spaces are well under control. We show they are 
unirational varieties over Q in many cases. This yields Galois realizations 
over the rationals for PSpn(q) under various conditions on n and q. 


QO Introduction 


Let gi,...,gr be generators of a finite group G with g,---g, = 1. We say 
they form a rigid generating system if for any generators g/,...,g/ of G with 
91 °°°g, = 1 such that gi is conjugate g; for all i, there is unique g € G with 
9; = 9~*9:g for all i. We say the system is rational if for each integer ™ 
prime to |G| there is a permutation 7 of r letters such that g” is conjugate 
9x(i) for all 2. One version of the rigidity criterion (e.g., [V1], Cor. 3.13) 
says: If G has a rigid and rational system of r generators then G occurs as 
the Galois group of a regular extension of Q(z) with r branch points. 


The rigidity criterion with r = 3 has been used extensively to obtain Galois 
realizations for various classes of (almost) simple groups, by Belyi, Thomp- 
son, Matzat’s Heidelberg school and others. A comprehensive list of results 
can be found in the forthcoming book of Malle and Matzat [MMI]. In [V2] 
it was shown that the groups PGL»(q) and PUn(q) have rigid generating 
systems of length n+ 1 (consisting of perspectivities). These systems (called 
Thompson tuples) are the only known rigid generating systems of a Lie type 
group (or any almost simple group) of length > 3. Now the question arise 
whether there are others. 


sa 
* Partially supported by NSF grant DMS-9401399 
*Partially supported by NSF grant DMS-9623199 


THOMPSON & VOLKLEIN: Braid-abelian tuples in Sp,(K) 219 


The Thompson tuples are closely related to the Burau resp., Gassner 
representation of the Artin braid group over a finite field (see [V4]). A 
related representation of the symplectic braid group was studied in [MSV], 
and this led to Galois realizations over Q of the symplectic groups Sp,,(2°) 
for 2°+' > n. For Spn(q), q odd, this method did not immediately apply, but 
the analogy with the case of the Artin braid group suggested there might be 
generators of Sp,(q) with similar properties as the Thompson tuples. The 
present paper is devoted to the study of these tuples. 

Section 1 studies the case n = 4, which in some sense is the heart of the 
matter. The general structure theorem for our tuples is Theorem 2.2. It 
contains the connection with Thompson tuples: If (01, ..., om, 71, 72) is one of 
our tuples, then (excluding a certain degenerate case) the elements co), ..., Om 
leave two complementary totally isotropic subspaces W, W’ invariant, and 
together with (7, ---¢m)~* induce a Thompson tuple in W and W’’. Once 
this is proved, the existence and uniqueness theorem for Thompson tuples 
(see [V2]) can be used to complete the classification of our tuples. 

The corresponding Hurwitz spaces are described in Theorem 5.1. Over 
C, their structure is quite clear: They are obtained by adjoining the square 
roots of certain cross-ratios of branch points. The Q-structure on these 
spaces is more difficult to pin down. Under certain conditions we can show 
that they are Q-unirational. In one case, this requires looking at the bound- 
ary, which contains Q-rational points coming from the Thompson tuples 
obtained by coalescing the last two branch points. 

From this we obtain regular Galois realizations over Q for the groups 
PSpn(q) under various conditions on n and q. This extends the results of 


[ThV]. 


NOTATION. Let K be a field, and n > 1. From section 2 on we assume 
char( A’) # 2. We let K* = K \ {0}, and V is the space of column vectors of 
length n over K. Here K is an algebraic closure of K. For u,v,... € V we 
let [u,v,...] denote the subspace of V spanned by these elements. We view 
GL,,(A) as matrix group acting on V by left multiplication. Let J denote 
the identity matrix in GL,(K). 

For ¢ € GLy(K) let yz be its characteristic polynomial multiplied by 
(-1)"; ie., xo(z) = [],(z — e;), where €1,...,¢n are the eigenvalues of 
7, counted with multiplicity. We say o is a perspectivity (resp., bi- 
perspectivity) if it has an eigenspace of dimension n — 1 (resp., of dimen- 
Sion > n —2). Then the product of two perspectivities is a bi-perspectivity. 
A tuple (71,...,0,) of elements of GL,(K) is called irreducible if the ele- 
ments generate an irreducible subgroup of GLy(K). 
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1 ‘Triples of bi-perspectivities in GL4(‘) 


In this section we take n = 4. Let u4,...,ug be the standard basis Vectors 
in V; ie., wy = (1,0,0,0)! etc.. 


Lemma 1.1 Let a,c € K*. Let a1, a2 be matrices in GL4(KX) of the form 


ay 0 Ly ro 1 0 0 0 

_ {0 b 23 2% = Mops Ale aS 36 
Te OF 0. EOE a a ae, 0 
0 0 0 1 \y3 ya 0 bs 


satisfying rank(aja2 — cI) < 2 and (a,a2 —al)(u) = 0 for some uF 0 in 
V. Assume a; # 6; fori = 1,2. Let H =< aj,a0 >. 

(i) Ifu=uy+...tuq anda fc then the r;,yy, are uniquely determined 
by a,c,a;,6;. If further c = —1 and a;b; = 1 fori = 1,2 then H leaves q 
non-zero symmetric bilinear form invariant; this form is non-degenerate if 


a #1 and char(K) #2. 

(it) Ifu= uy +uy+ug and az#ecF bo thena=c and (a,a2—cl)* $0. 
(iit) We cannot have u =u, + uo. Ifu = uo + ug then H fires the span of 
ug and u3. 

(iv) Ifa =1 then H fizes the vector u. 

(uv) Ifa; Ac #b; then H fires none of the 1-spaces [u,]. 


Proof : Write u = e;u; +... + e4uq with e; € A. Comparing the third 
and fourth coordinates on both sides of the equation aja, u=a u yields 


(1) €1yi te2ye2 = €3(a—a2), e1yst+erya = e4(a — bo) 
Using this and the condition on the first two coordinates yields 
(2) €32, + e4%o = €,(a — aj)/a, €323+e@4r@q4 = eo(a — b,)/a 


If u = u, + us then (2) yields a, = a = bj, contradicting the hypothesis. If 
u = ug + ug then (1) gives ys = 0 and (2) gives z; = 0. This proves (iii). If 
a = 1 then ag u = u by (1) and the shape of the first two rows of a; also 
a,a2 u=au=u. This proves (iv). 

Now we use the condition rank(a,;a2—cl) < 2. In the matrix a,a2 —- cl, 
subtract xz, times the third row plus z2 times the fourth row from the first 
row; and subtract 2z3 times the third row plus xq times the fourth row from 
the second row. This yields the matrix 


a,—c 0 czy cL 
0 b,—c cx3 cr4 
(3) oe 
Y1 Y2 adz2—c 


\ ¥3 Y4 0 bo ey 


’ 


EE 
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This matrix has rank < 2, hence all its 3 x 3-minors are zero. If a, 
fixes the 1-space [u3] then x1 = x3 = 0; hence vanishing of the left upper 
3 x 3-minor yields (a; — c)(6; —c)(az —c) = 0. This, coupled with suitable 
re-labeling of uj ,...,u4, proves (v). 

Proof of (ii): Assume u = u; + u2 + ug and ag $c F bg. Adding the first 
column to the second in the matrix (3) and using (1), (2) yields the matrix 


/a,—c a,-—c (a-—ay)c/a cz 
0 bh-—c (a—b;)c/a  cxr4 
Yr a—ay a2—Cc 0 
\ Y¥3 0 0 bo —c¢ 
Vanishing of the right lower 3 x 3-minor yields 


(by —c) (a—c) (b1ag-—ac) = 0 


Vanishing of the minor obtained by deleting the first column and second 
row yields 
(bo —c) (a—c) (ajaq—ac) = 0 


The last two equations imply a = c (because a; # b;). The last two columns 
of the matrix a,;a@. —clI are 


aor} / bozo 
aor bor 

v= 3 and wt ae 
aga—-c¢ 0 

0 bo —6 


They are linearly independent, hence span Im(a,a@2—cl). If (a,a.—cl)* = 0 
it follows that ker(a,a2—cl) = Im(a,a2—cl) = [v, w]. This space contains 
u = (1,1,1,0)', hence uw is a multiple of v. Hence 2; = z3. On the other 
hand, (2) gives z; = (a — a,)/a and 23 = (a — b;)/a. Contradiction, since 
ay ¢ by. 

Proof of (i): Assume u = u; +... +uq. In the matrix (3), add the first 
column to the second and the fourth to the third, and multiply the third 
column by a. This together with (1), (2) gives the matrix 


{%—-c a,—c (a-—a)e cz 
0 bi-ec (a-—b)e cx, 
: Y1 a@—d2 (a,-—c)a 0 
\ ys a—b. (bg-c)a be-c 


Again we use that its 3 x 3-minors are zero. The upper left 3 x 3-minor 
yields y;. Other minors yield yg, rz and x4. The remaining z;, yz are then 
iven by (1) and (2). The result is: 


(4) e..= (a2 —¢) ea) 2 (c — bz) (ac — a, a2) 


ac (a — b2) ac (aq — bx) 
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_ (a2 —¢) (ac — bib) (c — ba) (ac — bya) 
23 =, or = 
ac (a2 — bo) ac (a2 — ba) 


pes (a, — c) (ac — by a2) es (c — 61) (ac — aja) 
; c (a, — by) - ce (a, — by) 
(a, —c) (ac — b,b2) (c — by) (ac — a,b) 
y=, OF? = eS 
c (a; — by) ce (a, — b)) 
This proves the first assertion in (i). Now assume c = —1, ajb; = 1. Using 


(4) we check that the symmetric matrix 


( 0 (a, —1) (a2—1)a —a,a2,a—1 a,a+ a> \ 
S$ = , (a, - 1) (a2-l)a 0 a; + a2a —-@,;d2-a ‘ 
\ —a,;a,a—1 a, + aza 0 (a; ~ 1) (a2 ~1)) 

a,a+ aa —@,@.—a (a, — 1) (a2 -1) 9) 


satisfies a! S a; = S for i=1,2. Thus aj, a leave the symmetric bilinear 
form associated with S invariant. This form is non-zero since a; # 1 # ay. 
Since 

det(S) = ~—4 ayaa (a; —1)* (ag — 1)? (a—1)? 
we see the form is non-degenerate iff a # 1 and char(K’) # 2. 


Remark 1.2 The proof actually shows that given a,c,a;,6; € K* withaF 
c, a; # b;, there are unique matrices a1,a2 of the shape given in Lemma 
1.1 with rank(ajaq — cI) < 2 and (ayaq — al)(uy +... + ug) = 0. Tt can 
be shown that they generate an irreducible group if and only ifa #1 #¢, 
a1 a9b1b5 fs ac’, a; - c - b; and ac a {a1@2,61b2, a,b, bja2}. This is nol 
needed here. It can be used to produce rigid triples of bi-perspectivities im 
certain groups PGL4(K) and PUs(K). 


2 The tuples in Sp,(K) 


From now on we assume char(K) # 2. Let n = 2m > 4 even and G = 
Spn(A), the subgroup of GL,(K) fixing a non-degenerate symplectic form 
(,) on V defined over A’. For elements or subspaces U,W of V we write 
U 1. W iff (U,W) = 0; and Ut = {uv € V: v LU}. Let ¥ be the 
set of ¢ € G with rank(o — J) = 2 having two distinct eigenvalues # | 
Then these two eigenvalues are of the form a,a~! with a # +1, and ys = 
(2? -tz+1)(2—1)"~-? witht = at+a7! # +2. We call the eigenspaces C = 
ker(a — aI) and C’ =ker(o — a~+J) the centers of ¢. These are 1-space 
in V with C £ C’. The map o + t induces a bijection between the set of 
conjugacy classes of G contained in © and the t € K with t # +2. 
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emma 2.1 Leto,,...,.¢m41 € G with o; EX fori<m, rank(¢my1—I1) < 
9 and 01 °°'Om4, = —I. Ifn =4 assume additionally that o3 is not similar 
qiag(1, 1,-—1,-1). Let l1<j<k<m. Then we can label the centers of 0; 
(resp-» o;) as Cy, C} (resp., Cy, Ci) such that C; ee Oy Cy Je Ci, Cj Ba Cc. 
and C, £ Ce. Further, the group < oj,0% > fires Cj + Cy and Ci + Ci, 
hut none of C5, C3, Ce, CL. 


Proof : By braiding the tuple (cj, ...,¢m) (see section 3), we can move 

VAR into the first two positions. Thus it suffices to do the case 7 = 1, 
fk = 2. 

Let X; = (0; —I)(V). Then dim X; = 2 for i < m, and dim X41 < 2. 

We first prove that: 


(5) Ve= NiO os O Met Oana 


Let EF = X,+...+Xm-1+Xm41. Then E is invariant under all o;, and 
gm acts in V/E as —1. Since om € © this implies EF = V, hence (5). It 
follows that the space U = X; + X» has dimension 4 if n > 4. Also, U is 
invariant under o, and ap. 

Let o9 = 0102 = —(03---Om4i)~*. Then oo acts trivially in V/U, hence 
ker(o9 + J) C U. We have ker(o3 — 1)... ker(om41 — J) C ker(oo + J), 
hence 


(6) dim UNker(og + J) > 2 and (o—-JI)\(V) CU 


Recall that dim U = 4 ifn > 4. If-n = 4 then by (6) and the extra 
hypothesis in this case we see that dim U > 3. 


Case 1: UNU+t £0. 

Let R= UNMUt+. The elements oj, 02 act trivially in V/U, hence in 
V/(U + U+) = V/(R+). It follows that o,,¢ act trivially in R. Thus 
R#U, and so dim R= dim U — 2. 

Since go acts trivially in R, it follows from (6) that oo acts as —1 in 
U/R. Thus o; and —o3' induce the same transformation in U/R. Hence 
the centers of 7, coincide with those of a2 modulo R. Thus we can label 
the centers of o, (resp., 72) as Cy, Ci (resp., Co, Ch) such that C, = 
Cz and Ci = Cy mod R. Then they satisfy the orthogonality relations 
in the Lemma. If C; = Co then dim U < 4, hence n = 4; then on, 
2 act trivially in the 2-space C//C,, hence so does oo and thus ag is 
Similar to diag(1,1,-—1,—1). This is excluded by hypothesis, hence C, # C2. 
Analogously, Ci # C4. 

Thus C, +Cy = C; +Y = C2+Y for some l-space Y < R. Hence oj and 
2 fix C, + C2 (and analogously, C} + C4). Finally, o, acts on Cy + Y as 
diag(s, 1) with s # 1, hence fixes no l-space on C} + Cp = C1 + Y except Ci 
and Y. Thus o;(C2) # Co. This (and its analogues) proves the Lemma in 
Case 1. Thus we may assume from now on that U is non-degenerate. Then 
in particular, dim U = 4. 
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Connection to section 1: 

Set a; = oj|y for? = 1,2. Let u be an eigenvector of aja2 = ool; fod 
some eigenvalue a. Since U = X; + Xo, and X; is spanned by the centers 
of o; (for i = 1,2), there is a basis uj,...,uq of U such that the matrices of 
@1,@ 9 with respect to this basis are as in Lemma 1.1, with c = —1, a;b, ~ 1 
and aj,b; # +1 for i = 1,2. In particular, the centers of o; (resp., Oo. 
are [u;] and [ua] (resp., [ug] and [u4]). None of these 1-spaces is fixed by 
< 01,02 >, by part (v) (of Lemma 1.1). 

If a = 1 then by (iv) we get ker(a, — J)M ker(a2 — J) #0, hence Im(a, ~ 
I)+ Im(a2 — I) # U. Contradiction, since Xj + Xp = U. Thus a ¥ 1. 
Case 2: a#+1. 

Then the eigenvalues of aja2 are —1,—1,a,a~' witha #a~!. Thus the 
a-eigenspace of @,q@ is 1-dimensional, hence equals < u >. 

Replacing each u, by a scalar multiple we may assume u = e)u, +... 4 
equg, ey € {0,1}. Assume first that all e, = 1. Then we are in case (i) of 
Lemma 1.1. Thus there is a non-zero symmetric form (,)’ on U invariant 
under the a;. We can write it as (v,w)’ = (A(v),w), v,w € U, where LE 
End(U) commutes with the a;. Then fixes all eigenspaces of a; and a;a9; 
in particular, it fixes all < uj; > and <u>, where u =u, +...+ uy. Hence 
A is a scalar. Thus (,)’ is symmetric as well as symplectic, hence is zero — 
a contradiction. 

Thus not all e, = 1. By (ii),(iii),(v) and their analogues we see that 
e, # €2 and e3 # e4. Thus we may assume u = uz + ug (by suitable re- 
labeling of indices). Hence by (iii) the a; fix the 2-space W = C; + C2, 
where C) =< uy >, Co =< uz >. Then ae acts in W with eigenvalues 1 
and ay # 1, which forces W to be degenerate; i.e., Cy L Co. 

Now let u’ be an a!-eigenvector of a,a2. It follows analogously that 
u’ € W' =C}+C%, where C! is a 1-dimensional eigenspace of a;, the space 
W’ is invariant under the a; and C 1 C4. The space [u, u’] = Im(a,a2+/) 
is non-degenerate, hence cannot equal W. Thus W # W’. Thus if WOW’ 
is non-zero then it equals either Cy or C2, hence C, or C> is fixed by both 
a;. This contradicts (v), hence WN W' = 0. 

If C; 1 C; fori # 3 then CY = C+ MW’ is fixed by a; and ag. Again a 


contradiction to (v). This proves the claim. 


Case 3: a ,a9 has —1 as only eigenvalue. 

Analogously to (5) we get U = Im(a,—1)@ Im(a@1a29+/). Thus Im(a1 027 
1) is 2-dimensional. It cannot be non-degenerate, since @1@2+/ is nilpotent 
in Case 3. Thus Im(a;a.+/) equals its perpendicular space ker(aya2 + I). 
In particular, (@;a2 +J/)? =0. 

By the previous paragraph, we can write ug = u”+u' with u” € Im(a;—J) 
and u’ € Im(ajaq + I) = ker(ayay + I). Since Im(a, — J) = [ui, ua] We 
have u’ = ej u, + e2uo + ug for certain e, € WN. In particular, u’ 4 0. Thus 
we can take u = u’ in Lemma 1.1. Replacing u, and ua by suitable scalar 
multiples we may assume e, € {0, 1}. 
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Since (@1a@2 — cI)? = (ajao2+ I)? = 0 we have u # wu, + uo + ug by (11). 
Thus u = u, +ug or u = uot+ug. Hence the a; fix [u1, ug} or [ue, ug] by (iii) 
(and its analogues). Replacing u3 by ug yields that the a; also fix [uy, u4| 
or [u2, 4]. The rest is as in Case 2. a 


Let 7 be the set of transvections in G = Sp,(K) (elements 7 with 
rank(r — J) = 1). The sets 7 and © are invariant under the normalizer 
G = GSpn(4) of Gin GL, (4), and G acts transitively on 7. If A’ is finite 
then 7 consists of two G-orbits. 

Let NV be the set of tuples (01,...,¢m,71,72) with oj € L, tr; € T and 
1+": OmT172 = —I. Let G act component-wise on NV, and let N be the set 


of G-orbits on NV. 


Theorem 2.2 Let n = (¢j,...,0m,71,72) EN. 


(i) The centers of all the o; are mutually distinct, hence form a set of 
cardinality n = 2m. Orthogonality 1 1s an equivalence relation on this set, 
and there are eractly two equivalence classes, of the form {C\,...,Cm} and 
{Cy,...,Ci,}, where C;, Cj are the centers of o;. Let W (resp., W') be the 
sum of the C; (resp., C}). Then W and W' are maximal totally isotropic 
subspaces of V, invariant under allo,;, and dim (WOW’) <1. The group 
< 01,...,%m,71,72 > acts trreducibly n V. 


(iz) Let a; be the eigenvalue of o; on C;. If ay---am # (—1)™7?} then 
V=WOW’. In this case, the restrictions of o1,...,¢m and of (o,---om)~' 
to W form a Thompson tuple (in the sense of [V2]); in particular, the group 
<01,...;0m > acts irreducibly in W (and W’). 


(111) Consider another tuple n = (04,...,6m,71,T2) € N, and define the a; 
asin (ii). (They depend on the choice of one of the two equivalence classes 
associated with n). Then n and n are conjugate under G if and only if 
Bither a; = a; for alli or a; = ag for alli. 

(iv) If in the tuple (o4,...,0m,71,72) we replace the m-th entry by gm = 77} 
and the (m+ 1)-th entry by 7, = T/™"! then we obtain another tuple n in 


N.. Associated a; are given by a; = a; fori=1,..,.m—1 anda, =a7!. 


fe) Let Lb = Klepwacds jo Then |b KK)... fet SS af Tee hs, 
otherwise S={sE€L: Nryx(s) = 1}. Then alla; he in S. Now suppose 
K as finite. Then 7, and tT. are G-conjugate if and only if the element 
(—1)™-1!a,---am is a square in S. 


Proof : Set om41 = 7172. Then om41 is not similar diag(—1,—-1.1...., 1) 
(because the latter matrix cannot be the product of two transvections). 
Hence ¢},...,7m4 are as in Lemma 2.1. This implies the first assertion In 
(i). 

For the second assertion, let 7, j,k be distinct indices from {1,...,m}. Let 
C;, C;, Cy be centers of o;, 7}, ox, respectively, satisfying C; 1 Cj and Cj 1 
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Cy. Let Ci, Ci, Cy be the other center of oj, oj, O;, respectively. Assume 
C; £ Cy. Then C; L Ch (by the Lemma). We consider the case i < j<k 
the other cases can be done by obvious modifications. By braiding action, 
the elements o;, oj’, and a, occur in a tuple with the same properties Me 
(01, ---;m41). The centers of o/? are C = (Ci) and C’ = oy (Cl), We 
know that C+ C; = C; + C;, and either C 1 Cy or CL 1 C UL Ch 
then (C+ Cj) £ Cy, hence C; L Cy, contradicting the assumption. [f 
C1 C, then (C+ Cj) 1 Cy, hence C; 1 Cj, also a contradiction (singe 
C; 1 C;,). This proves that orthogonality is an equivalence relation on the 
set of centers. By Lemma 2.1, there are exactly two equivalence classes 
each of which contains one center of each oj, i < m. Hence we can label 
these classes as claimed in (i). 

The spaces W, W’ are clearly totally isotropic, and by Lemma 2.1] they 
are invariant under all oj. Let B, be the 1-space (7; — I)(V). Then W + 
W' + By is invariant under all o; and 7,, and these elements act trivially 
in V/(W + W' + By). Hence r2 acts as —1 in V/(W + W’ + By), which 
implies W + W’+ B, = V. Hence dim (W + W’) > n—1. Since W, W’ 
are totally isotropic this implies dim W > m—1. If dim W = m—1 then 
O1,...,0m act trivially in the 2-space W/W, hence Om+1 acts as —1. Thus 
Tm+1 is similar diag(—1,—1,1,...,1), a contradiction. Hence dim W = dim 
W’ =m. This proves all assertions in (i) except the last one, which will be 
proved after the proof of (iv). 


(i1) Set Cr4i1 = (om41 — I)(W). We claim that dim Cmii = 1 (= dim 
Ci, 7 < m), and o; acts trivially in W/C; for i = 1,....m+1. The space 
WW’ is orthogonal to all C;, C/, hence it contains none of the C;, C’ 
for i < m. This implies that o; acts trivially in W/C; for i = 1,...,m. If 
V=W6W’' the group < @1,...,0m41 > acts dually in W and JV’; since 
rank(om41— I) = 2 by (5) it follows that dim Cm41 = 1 and om4, acts 
trivially in W/Cy41. 

Now assume dim (WM W’) = 1. The o;, 7 < m, act trivially in V/(W + 
W’), hence in R := WNW’ (because of the dual pairing between V/(W+W’') 
and WNW’ =(W+W’)+). Thus om4, acts as —1 in V/(W + W’) and in 
R. Since rank(om41 — I) < 2, it follows that om41 acts trivially in W/R; 
and R= C41. This proves the claim in the previous paragraph. 

We further claim that C\,...Cm41 form a frame in W; i.e., any m of 
them are linearly independent. Since W = C, @...® Cm we only need to 
show that C4; does not lie in the span of m — 1 of the others. If, say, 
Cm41 CU = Ci +...4+Cm_, then U is invariant under all o;, and om acts 
as —lin W/U. Contradiction, since oj, € E. 

In the terminology of [V2], it follows that the restrictions of 
O1,-.;0m,—Tm41 to W form a 
@mere rene lars penne: ee (-1)™-1a7? ---a>!)-tuple in GL(W), adapted to the 
frame C1,...Cm41. If a, ---am # (—1)™~! then this tuple is irreducible by 
[V2], Lemma 2, hence is a Thompson tuple; further, WM W’ = 0 since this 
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pace is invariant under the oj. This proves (11). 


(iv) It is clear that nm € MN. The centers of G6, = of! are Te) and 
(Cm): Since n € N we know by (i) that exactly one of te (Cu) nn 
O,t--+Cm-i or 7, (Ci) L C1 +...+Cm_ 1 holds. We need to rule out 
the former case. First we prove: 


(7) om+41(Ci) # C; for be DY nh 


By braiding it suffices to prove this fori = m. Assume om4i1(Cm) = Cm. 
Then the element o1---¢m-1 = (—O@m@m+1)7! fixes both summands in 
the direct sum decomposition W = C,, @(C, +...+ Cm_1). Hence this 
element acts trivially in C,, (since it does so in W/(C, +...4+. Cm-_1) ). Also 
Om+1 acts trivially in Ci, since Cy, # Cm4i. Thus om acts as —1 in Cy. 
Contradiction, since o,, € L. This proves (7). 

If (m1 —1)(V) C W then W C (rm, —1)(V)* = ker(r, — J), hence 7;|w = 
id and so or lW = 77 '|w; then (r7*~— 1)(V) C W, hence also 73 *|w = 
id and so One |W = id, contradicting (7). This and its analogue proves 


(8) (#—1)(V) CW for 7 =1,2 


it 7,(C;) = C; for some i < m then 7, °'(C;) = eran (Gro Cc W, hence 
im (C;) = ©, by (8) and so om41(C;) = ™79(C;) = C;, contradicting (7). 
This proves 


(9) Ti (Cokes C; for B= 1,...,m 


Now let v #0 in Cy. Then u := (tr, ' — I)(v) # 0 by (9). If (Cm) 1 
Cyt...4¢Cm_; then u L Cyt+...+Cm_1, hence Cy +...4+Cmi1 C ut = (1 - 
I)\(V)+ = ker(r, — I), contradicting (9). Thus r7*(Cm) £ Ci +... + Cm-1. 
This concludes the proof of (iv). 

Now we prove the remaining part of (i): irreducibility of the group < 
71,...,0m,71,72 >. By (iv) we may assume a,-:-ay, # (-1)™—!. Then 
< 0],...;0m > acts irreducibly and inequivalently in W and W’. Hence it 
suffices to show that 7, fixes neither W nor W’. This follows from (8) and 
(9) (and analogues). 


(iii) Set b = (~1)™ay*---az. By the proof of (ii), om41 acts on W 
as diag(6,1,...,1) if 6 # 1, otherwise as transvection. Further, C1; and 

m+1 = (Om4+1 —1)(W’) are 1-spaces. Thus Z := Cm4it+Chy, = (@m4i1- 
I)(V) is a 2-space. The space Y = (7, —1)(V) + (12 — I)(V) has dimension 
< 2 and the element o,,41; = 7,79 acts trivially in V/Y. Hence Y = Z. 

Now consider some n as in (ili). The ”only if’-part of the claim is clear. 
To prove the ”if’-part, we may assume K = A. Indeed, if m and n are 
conjugate under GSp,(A) then they are also conjugate under GSpn(A‘) by 
the argument in [V2] after Theorem 3. 
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By (iv) we may further assume a;---am # (—1)™7! (ie, b # ~1) 
Then V = W @W’ by (ii), and similarly for n. By transitivity of G én 
pairs of complementary maximal isotropic subspaces of V we may assume 
W (resp., W’) is spanned by the a;-eigenspaces (resp., a; *-eigenspaces) of 
the o;, 7 = 1,...,m. The uniqueness theorem for Thompson tuples (see 
[V2]) implies now the existence of g € GL(W) with ¢;|w = (oilw)9 for 
t= 1,...,m. This g extends to a unique element of G fixing W and W’. Vi, 
conjugating with this element we may assume G; = o; for all 7. 

Assume first b #1. Then o,,4; acts on Z as diag(b,b~') and acts trivially 
in oe hence Z is non-degenerate. Thus 7 and 7 act non-trivially in Z: 
hence induce transvections in Z; same for 7, and 72. A simple calculation 
in 2 x 2-matrices (see [ThV]) now shows that 7|z and 7\|z are conjugate 
under an element of GL(Z) fixing Cym41 and C),4;. This element extends 
(uniquely) to some fh € G acting as a scalar in W and in W’. This 
conjugates 7, to 7, and centralizes the o;, hence conjugates n to n. 

Now assume 6 = 1. Then om41 acts trivially in Z, hence Z is totally 
isotropic and thus also 7), T2 act trivially in Z (since (7; — I)(V) C 2). 
Choose es € W\ (Ci,4,)+. Then e1 := om4i(e3) —e3 € Cm4i \ {0} 
(since e3 ¢ Z+ = ker(om41 — I) ). Choose eg € W’ with (e;,e4) = 1 and 
(e3,e4) = 0. Then e2 := om4i(ea) —e4 € Chy4, \ {0}. Further, (e1,e2) =0 
and (€2,€3) = (@m41(e4),€3) = (€4, m4 (€3)) = (€4,€3 — e1) = —(€4,€1) = 
1. Thus the 4-space X = [e1, €2, €3,€4] is non-degenerate, and om41, 71, 72 
fix X and act trivially in X+. 

Any p € GL(X) with (p —J)(V) C ZC ker(p — J) is given in the basis 


€1,€9,e€3,¢€4 by a matrix of the form 


[Ip A 

(0 In 
where Ip is the 2 x 2 identity matrix and A € M2(KX). Such p lies in Sp(X) 
iff the diagonal entries of A are equal; and p is a transvection iff det(A) = 0. 


. 


ac 
Further, ¢m4 corresponds to A = [9. Let 7, correspond to € 1; Then 


9 


aoc ae 
a- = cd, and tT? corresponds to Ip — (3 -). The condition that the 


latter matrix has zero determinant yields a = 1/2. Thus d # 0. Since 
(7, — I)(V) = [(1/2)e, + deo], it follows that we can conjugate 7, by an 
element h € G as above to assume (7, — I)(V) = (7, — I)(V). This forces 
the 2 x 2 matrices corresponding to 7, and 7, to be equal. Hence 7, = 71: 
This concludes the proof of (111). 


(v) Again we may assume a,---am # (-1)"™~!, hence V => W@ w'. 
Each a; satisfies an equation z2 —t;r +1 =0 with t; € K, hence if a; ¢ 5 
then [K(a;) : K] = 2 and Nxca,ysx(ai) = 1. Thus L = K(aj,...,am) 18 4 
finite Galois extension of K, and Gal(Z/A’) permutes C; and C} for each 
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j= 1,..,m. Also Gal(L/A) permutes W and W’, since these are the only 
non-zero, proper subspaces of V invariant under all o, (and W. JV" are 
defined over L). The stabilizer of W in Gal(L/A) fixes each a;. hence is 
trivial. Thus [L : A] < 2, and if [Z: A] = 2 then the non-trivial element of 
Gal(L/A) maps each a, to ae. Hence all a; € S. 

Now assume J is finite. We use the notation from the proof of (i). Note 
that Z is defined over A, 1.e., spanned by Z—x = ZK”. Further, 6 # —1 
by assumption. Assume first } # 1. Then t,|z and r2|z are non-commuting 
transvections, hence by [V1], Lemma 3.27 there is a basis of Zx in which 
these transvections take matrix form 


1 W% 1 0 
(oe € 1) and J> = (; : 


with u€ A*. Then u+2 = trace(T,72) = trace(om4i{z) = 6+ 67", hence 
u = 6+b-!—2. It is easy to see that 7; and ro are G-conjugate if and only 
if T, and T> are conjugate in SLo(A’). The latter holds if and only if —u is 
a square in A”, see [V1], Lemma 3.27. 

We have seen that 7, and To are G-conjugate if and only if 2—6— 67? is 
a square in A”. It remains to show that 2—6—b7! is a square in A™ if and 
only if —b is a square in S. If 6 € K (i.e., S = K*) this follows from the 
identity 2—b—6-! = (—6) (1—67!)?. Now assume b ¢ K, i.e., [ZL : A] = 2. 
If2—b—b-! =k? for some k € K* then 


b-1 2-b-b-! 
NijK(——) wie = 


and a 
(6 —1)- b-1., 


2—b-—b-! ~ 


Conversely, if —b = s* for some s € S then 


—h = 


2—b=07) = (s+s 1) € K?. 


This concludes the case 6 4 1. The case b = 1 can be reduced to the 
previous case using (iv) and braiding unless a? = —1 for all 7. In that case 
the claim can be shown using the set-up of (111). We omit the details. 


3 Braiding action on the tuples 


The Artin braid group 8, on r strings has a presentation with generators 
Q1,...,Qr-1 and relations 


Q:Qi41Q:i = Q:41Q:Qi¢1 for? =1,...,r-2, Q:Q; = QQ: for [e—3| > 1. 
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Mapping Q; to the transposition (2,7+1) yields a homomorphism « “be 
Sy. The kernel of « is called the pure braid group, and denoted by B() ie 
is generated by the 


O57 Ware! Qian Oe, Ora Oe f Pathe Sx. 


The braid group B, acts from the right on tuples (g,...,9,) in G" by the 
rule that Q; maps (g1,..., gr) to 


-l 
(915+ +65 Gi—1y Gide Gini 9iGi41 a+-03 Gr): 


Now let r = m+ 2 in the set-up of the previous section. We get an induced 
action of < BY), Qy,..., Qm— 1,;Qm41 > on N and N. From Theorem 29 
(ii), (iu) we see that Qm4i and the Q,; with 1 <i<j <m act trivially 


on N. Since Qim+2 = Qoat for 1 <2 < m it follows that Qim4; and 


Qim+2 induce the same permutation on N. Further, Q; (i < m) acts as 
transposition (7,7+1) on the tuple (aj, ...,a@m) associated with an element of 
N (see Theorem 2.2 (ii)). Finally, Qi m41 (i < m) maps n € W associated 
with (aj,...,dm) ton € WN associated with 


(10) (aisar) Q;_- 1,4; ey re eee Gin} 


For 2 = m this is (iv) of Theorem 2.2. If true for some 1 <i < m it follows 
for 7='1 because: Qj .meic= Ort. This proves (10). Combined with 


Theorem 2.2(i11) this implies: 
Corollary 3.1 


(1) Two elements of N, represented by the tuples (O1,-.)%m,71,72) and 
(O1,..;Gm;71,72), le in the same B)-orbit tf and only if trace(o;) = 
trace(o;) for alli. 


(ii) All elements of N have the same stabilizer in B). This group K 1s 
generated by the squares in B") together with the elements 
Qij, l<i<jegm 


Qim4+1 Qn l<gi<cm 
Oimaie Oram. dnd Oneimie- 


Thus B‘")/K 1s an elementary abelian group of order 2™-! acting freely ov 
N. In particular, all B)-orbits on N have length 2™-!. 
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4 Existence of the tuples, and generated sub- 


group 
Let t1,..tm € WK \ {42} and t = (t1,...,tm). Let M(t) be the set of all 
G1, -+-1%m,71,72) € N with trace(o;) = t; +n—- 2. Then og; is similar to 
Biag(a;,a;°, 1,...,1), where a; + az’ = t,;. Let V(t) be the set of G-orbits 
on N(t). 

First we show that if a; € A for all 7 then V(t) # 0. We may assume 
a,:°°dm #(—-1)™~*. Let Wo be a K-vector space of dimension m. By the 
existence theorem for Thompson tuples (see [V2]) there is a Thompson tuple 
+1 oo, —Oma1) in GL(Wo) such that of (i < m) has diag(a;,1,..., 1) as 
a representing matrix, and rank(om4, — J) = 1. Let Wg be the dual of 
Wo, and <,>: Wo x Wi — K the dual pairing. Extend o® to an element 
a; of GL(Wo © W5) by its natural action on Wj. Then the o; preserve 
the non-degenerate symplectic form f((w,w’),(u,u’)) =< w,u’ > - < 
u,w’ >. As in the proof of Theorem 2.2 (iti) we see that om41 can be 
written as 0m4 1 = 7,72 for transvections 7), 7. that also preserve f. Then 
(o1, wey Tm, iyi) E N(t). 


Lemma 4.1 Suppose K = Fy is finite. Then N(t) # O if and only if 
a; EK for alli ora; ¢ K for alli. 


Proof : The condition is necessary by Theorem 2.2 (v). The remarks 
preceding the Lemma show that there is a tuple n = (0},...,0m, 71,72) 
in Spn(K) with the desired properties. Let a be the automorphism of 
Spn(A) raising each matrix entry to the power q. Then n° is a tuple with 
the same properties, hence by Theorem 2.2 (iii) there is h € Spa(A’) with 
n* =n". By Lang’s theorem there is g € Sp,(A’) with h = g~'g*. Then 
(n-')®° = n9". Hence n?* € Nt). E 


Lemma 4.2 Suppose n> 10 and K = Fy, where q ts a power of the prime 
p. Let (01,...,;0m;7,7T2) € N, with o; similar diag(a;,a;°,1,...,1). Let 
tj=a;+ an 

(1) The group G = Spn(q) is generated by oj,...,0m,T1,T2 if and only af 
Bo (tijanstm) = Fy. 

(2) Assume ay::' am #(—1)™7!, and set H =< 04,...,0m >. 

If Fy (ti,....tm) = Fy and the a; le in F, (resp., the aj lie not in F,) then 
H is isomorphic to a group between SLm(q) and GLm(q) (resp., between 
SUm(q?) and Um(q") ). The images of o1,...,¢m and (o,---om)~* under 
this isomorphism form a Thompson tuple in GLm(q), resp., GLm(q") whose 
i-th element (i < m) is a matrix similar to diag(a;,1,...,1), and whose last 
element o satisfies rank(o + I) = 1. 


Proof : We first prove (b). By Theorem 2.2 (ii) we know that restric- 
tion gives an injection H — GL(W), and the images of o1,....0%m and 


14 
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(71 -+-0m)~* form a Thompson tuple in GL(W) with o;|w (¢ < m) and 
(o, - --Om)lip of the claimed form. Let L = Fy(a1,...,@m) (= Fg or F,2) 
Then W is defined over L, hence the oj|yy lie actually in GL(WN Ls * 
GLm(L), and their eigenvalues generate L. Now (b) follows from the clas. 
sification theorem for Thompson tuples (see [V2]). 

Part (a) follows from (b) as in [MSV], Theorem 1. (Actually, since we 
are now in the case char(A’) # 2 the proof is much easier than in [MSV)]), 


5 The resulting Galois realizations 


From now on we let A = F,, and so G = Spp(q). Let G= G/{+I} = 
PSpa(q). Fixedi,..,tm € BRAKE} and-set t = (hy. tm)e . Choose 
Q1,...,@m € Fy2 with a; + ae = t;. Assume that either all a; € F, or 
all a; ¢ F,. In the former case, set S = F*, and in the latter case let § 
be the group of elements of F,2 of norm 1 over F,. Consider the class of 
G consisting of all matrices similar diag(a;, ae: 1,...,1), and let C; be the 
image of this class in G (for i = 1,...,m). Fix one class of transvections in 
G, and call it C,_, (where again r = m+2). Here we call an element of Ga 
transvection if it is the image of a transvection of G. Let C(t) = (C1, ...,C,), 
where C, = C,_; if (-1)™~'a,---am is a square in S, and otherwise C, 
is the other class of transvections in G. We say this tuple is rational if 
Cf ...,C* is a permutation of C,...,C, for each integer k prime to the or- 


der of G. Similarly, we call the tuple (a1,...,@m) rational if GS y .csp Hr is a 
permutation of aj,...,@m for each integer k prime to q? — 1. We order the 
tuple such that C) =... = C,,, Cp,41 =... = Crygr,, etc., and there are no 


further equalities between the C;. 

We further assume there are generators gj,...,gr of G with g; ---g, = 1 
and g; € C; fori = 1,...,r. Let N(t) be the set of G-orbits on the set of 
those tuples (g1,...,gr). Each g;, 1 < m lifts to a unique oj € L; and gr-1, 
gr lift to unique transvections 71,72 € G. Then 01 ---¢m17T2 = —JI because 
by Scott’s formula [Sc] there is no such tuple generating G with product 
= [. Since the elements of G \ G switch the two classes of transvections, it 
follows by Theorem 2.2(v) that A/(t) can naturally be identified with the 
above set N(t). This identification is compatible with the natural action 
of B'"), Hence by Corollary 3.1 the action of B”) on A(t) is transitive, 
and the kernel of this action is the group K. Since B'")/K is abelian, X 
also equals the stabilizer in B‘") of each element of A(t). Let B,(t) be 
the stabilizer in B, of N(t), and let Ky be the stabilizer in B,(t) of some 
element of V(t). Then K = K2N 8B"? and B,(t) = K2B") (since B') acts 
transitively on AV/(t) ). The map « : B, — S, induces an isomorphism 
BA/BS. KS he 

Let 21,...,27 € C be algebraically independent over Q. Let S, act 
on the field Q(x1,...,2,-) by permuting 2,...,2-. The fixed field of Ss 
is Q(s1, ..,Sp), Where $1,...,S- are the elementary symmetric functions in 
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w+y2r- Let L(t) be the fixed field of the subgroup S,, x S,, x .... Then 
p(t) = Q(u,...,u,), where ui,...,ur, are the elementary symmetric func- 
fions in Z1,...,2,, etc. 

Let Qo/C(2) be a Galois extension with branch points z;,..,z, and Ga- 
Jois group isomorphic to G, such that C; corresponds to the class of distin- 
guished inertia group generators associated with x; fori = 1,...,r. Such Q 
exists by Riemann’s existence theorem because V(t) # (see ° [V1] Thm. 
9.13). By [V1], Prop. 7.12, this Qo has a unique aint field of definition 
L, CC; ie., there is 2; C Qo, Galois over L,(x) and regular over L; such 
that estriction gives an isomorphism G(Q9/C(r)) — G(Q1/L1(z)) (and Ly 
is minimal with this property). This Ly is finite over Q(s,,...,s,). Further, 
L, is regular over Q if the tuple C(t) is rational (because B‘”) acts transi- 
tively on V(t), see [FV], Thm. 1 or [V3], Thm. 3.9). Each a € Aut(C/L;) 
extends to an automorphism of Qo fixing x and centralizing G(Qo/C(z)). 
By the branch cycle argument ([V1] Lemma 2.8), a permutes z1,...,z, such 
Bhat if a(z,;) = 2; then C; = bs 5 where k is an integer such that a acts on 
the |G|-th roots of unity as (++ CF. 

Let Lo = L,Q = minimal field of definition of Q5 containing ©. The ex- 
tensions Lo/Q(s1, say Sp) and O(z1, vy Sr) /Q(51, ..., Sp) are unramified over 
the complement of the discriminant locus in affine r-space with coordi- 
nates s;,...,s,. Let II be the maximal extension of Q(s1, ..-; Sp) unramified 
over this complement. Then G(II/Q(sj,...,s,)) can be identified with the 
profinite completion of B,. Thus under the Galois correspondence, Ly and 
Q(21,... ...,Z,) correspond to certain subgroups of finite index of B,; actu- 
ally, Q(2x1, ...,€,) corresponds to B\) and Ls corresponds to (a conjugate 
of) Kz by [V1], Cor. 10.21. Thus L := LoQ(xrj,...,2,) corresponds to 
K2N BY =K, and Lo NQ(z1, ...,f,) corresponds to K2B\") = B,(t). Thus 
LaNnQ(2y,...,2 Es) = L(t) = Q(uz, ...,U,). = 

Let P = G(L2/L,). Then restriction gives an isomorphism T — G(Q/Q) 
if the tuple C(t) is rational (because then L, is regular over Q). Fix some 
y € I, and extend it some way to an automorphism a of C. This a permutes 
@1,...,2, as described above. Thus y permutes uj,...,u, accordingly, per- 
muting the blocks {u,..., up, }, {Up, 41, ---) Ur; 4rq },etc. Hence we can extend 
11z(t) to Q(z1, ...;@,) by permuting 21,...,2, In any way compatible with 
the action on these blocks. This further extends y to L = L2Q(zj,...,2,) 
because Lz and Q(2,...,2,) are linearly disjoint over L(t). 


Theorem 5.1 Let ty,...,tm € Fy\{4+2} and sett = (t),...,tm). Let C(t) = 
(C,,...,C,) be the assoctated tuple of conjugacy classes of G = PSpn(q) 
(where r = m+2). Assume N(t) #0. Let x4,...,2, € C be algebrarcally 
independent over Q. Let Qo/C(z) be a Galois extension with branch points 
21,...,%, and Galows group isomorphic to G, such that C; corresponds to 
the class of distinguished inertia group generators associated with x; for 
t= 1,...,r. Let Ly be the minimal field of definition of Qo, let Lo = L1Q 
and L = L,Q(xq,...,2,). Then the following holds. 
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(i) We have L= Ol 41, +0) Ym—11 2m: 2m41;2m42); where 


2 Im4+i1— Ti Im+2— 21 ; 
y = Oo Oi |... m-1 
Zm+1— lm Lm+2— Im 
the cross ratio Of 23,2 nem tise = Q(21, +) 2m; 2m41;Lm4o), 
where 
2 Im41— 2 i 
z= oo, t= i,...,m 
Lm+2— Fi 


Then [L: L] =2 and L = L(zm). 

(11) Assume the tuple (Cj,..., Cy) 1s rational and also the class Cm ts rational 
in G. If either q is a square or m is odd and Co, = Coy_; for 2v <m then 
we can ertend the action of T = G(L2/L1) to an action on L such that L! 
ts a rational function field over Q. 


(211) Assume there are aj,...,@m € ee with a;ta;* =t,; such that (aj,..., Am) 
1s a rational tuple. We further assume m is even and do, = Goy_, for 
2v < m, as well as C,_; # C,. Then we can extend the action of T to L 
such that L' is a rational function field over Q. 


Proof : (i) The above identification of G(II/Q(sj,..., 8-)) with the profi- 
nite completion of B, induces an identification of G(Tl/Q(21, w+) 2p)) with 
the profinite completion of B‘"), Under this identification, Qi; becomes 
the generator of an inertia group over the place z; = zx; of Q(z1, bers 
for 1 < i<j <r (see [Mal], 3.2 or [SV, Prop. 1 and section 3.2]). Ad- 
joining /zp—22, 1 < k < <r yields an extension of Q(z,...,2,-) such 
that the place 2; = 2; ramifies in this extension iff {i,j} = {k,é}. Since 
Jz, — te € I we get 
Claim 1: The element Qj; maps ,/aj —2j to —,/x; —2z;, and fixes all 
Jr, — xe with {k, 0} # {i,j}. 

Using Claim 1 we check that the generators of K given in Corollary 3.1 
fix y1,...;Y¥m-—1- Thus the fixed field L of K contains y,...,ym-1- Now (i) 
follows because (Olan, +) Ym—1; 2m. Lm41; 2m42) Q(21,...,2r)] = 9m-l= 
(BO) : K] =[L: Q(2,...,2,)]. (The assertion about L is clear). 

(ii) Assume first that q is a square. Then the classes C41 and Cm+2 (of 
transvections) are rational in G. Since also Cm is assumed to be rational, it 
follows from the remarks before the Theorem that we can extend the action 
of [ to L fixing 2m,2m4i,2m42 (and permuting the other z;). Then r 
permutes the +y;, hence fixes the linear span of the transcendence basis of 
E given in (i). Thus L! is rational over Q by [V1], Lemma 8.7. 

Now assume g is not a square. Let p be the prime dividing g. Let C’ be 
the field of p-th roots of unity in Q, and C” the field of (q? — 1)-th roots of 
unity. Then for C = C’C” we have G(C/Q) = G(C/C’) x G(C/C"). 

Define a permutation action of G(C/Q) on the 2; as follows: G(c/e) 
interchanges ro, and x2o,_1 for 2v < m, as well as zm4, and tm42. (Since 
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G(C/C") is cyclic, there is only one such action). Further, Ges | fixes 
gm+l) Um+2, and 8 € G(C/C’) maps x; to x; as follows: We have cy = aa 
where & is an integer such that @ acts on the (q?—- -1)-th roots of 1 as ¢ + cr 


~ 


To distinguish 7 among the 7 with C; = C;, let 2 (resp., j) be the smallest 
index with Cy = C; {resp., C3 a ON. Then j-j =i ~i. Since these 
actions of G(C/C") and G(C/e") commute, they combine to give an action 
of G(C/Q). 

If k € Z is a square (resp., not a square) mod p then the two classes 
Cn+i and Cy42 of transvections are fixed (resp., interchanged) by taking 
them to the k-th power. It follows by the remarks before the Theorem that 
the action of [ extends to L such that [ permutes 7,...,2, via its quotient 
G(C/Q) as in the previous paragraph. 

Now assume m is odd. Then G(C/Q) fixes rm, hence G(C/C”) inter- 
changes +y2, and +y5,\_,. Since G(C/C’) fixes the sets {21,23, ..., &m—2} 
and {r2,24,...,&m-—1}, it follows that I fixes the linear span of the transcen- 
dence basis 2m, 2m41;2m42; Yi) Us Y3, Yay ... of ZL. Thus the claim follows 
again by [V1], Lemma 8.7. 


(iii) Assume the hypothesis of (iii). Then we can refine the action of 
G(C/Q) on the 2; such that if 8 € G(C/C’) maps 2; to 2; then af = aj, 
with & as above; and further, ifz (resp., j) is the smallest index with a; = a; 
(resp., a3 =a;) then j—j =t~1. 

This are extends the action of [ to L. Below we extend it further from 
L to L. Assume this for the moment. Then we see as before that IT fixes 


the linear span of 2m41,2m42:721)29 123; z/,.... The rest is as for (ii). 


Rational points of L over the discriminant locus: 

Identify 21,...,2,- with the coordinate functions on Q", which gives the 
usual identification of the points of Q” with the maximal ideals of Q[r,,...,2-]. 
This gives an action of [ on (oy (different from the standard Gp-action). 
Define a (finite) point of L, L etc. to be a maximal ideal of the integral 
closure of Q[zi,... ang tp ES L etc. We need 
Claim 2: L has a [-fixed point p over each [-fixed point of Q” of the form 
6D).<in De) With Pray = pe and prep; for, lei < gsr 4. 

Proof : Pick a base point pg on the Riemann sphere P! different from 
the p;, 1 > 1. Consider loops in P? \ {p,...,pr-1}, based at po. Choose 
simply closed disjoint loops 7; around p;, 7 = 1,....r — 2, and 7 around 
Pr_1 = Pr, satisfying the usual condition that 7, ---7,—-27 is null-homotopic. 
Let p. # p, be close to p, and inside y (hence different from pj,..., Pr-1)- 
Choose loops 7-1 and 7, inside y around p,_; and pi, respectively, such 
that 7,17, is homotopic to ¥. 

The Galois covers y’ of P! with group G, branch points pu, ..., Pr—1)Pr 
and associated classes C),...,C, (see [V1], Ch. 4 and 5) correspond to the 
elements of M(t), where a tuple (g1,...,9,) corresponding to y’ consists 


of the natural images of 7,...,¥- in Deck(y’) = G. Fix such gi,---.9r 
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having the property that their lifts o,,...,0m,71,72 €G (see the beginnin 
of section 5) correspond via Theorem 2.2 to the tuple (a,,...,@m) used to 
define the action of G(C'/Q) on L. 

We consider the behavior of the covers y’ (corresponding to this fixed 
tuple g1,...,gr ) aS p, moves into p,: Then vy’ collapses into a cover ¢ of Pl 
with group H =< gy,...,9r--2 > and branch points pj,...,pp—1; the Images 
in Deck(y) = A of 71, ...,%r—2, 7 are 91, ---) Jr—2,9, Where g = Gr—19r- Note 
that H = H/{+1}, where H =< «j,...,¢m > is as in Lemma 4.2: in 
particular, the image of o; under the natural embedding H — GLin(q?) ig 
similar diag(a;, 1, ..., 1). 

The field extension 2/C(zr) corresponding to ¢ is obtained by specializing 
the generic extension 9. This specialization maps 2; to p;, hence yields a 
specialization of L corresponding to a point p of L over (pi,...,p-). 

If an element of I maps p to another point p’ of L over (Dine apedy 
then it transforms Q into the extension of C(z) associated with p’. But we 
know how I acts on the ramification type of an extension of C(x): via the 
branch cycle argument [V1] Lemma 2.8. The class of H associated with 
the branch point pj, i < m of Q/C(z) is represented by diag(a;,1,..., 1). 
By definition of the action of [ on 2,...,2,, hence on pj,...,p-, and the 
branch cycle argument it follows that the extension of C(z) associated with 
p’ corresponds to the same choice of (a1, ...,@m) (since the tuple (aj, ..., Am) 
is rational). But the 2”! points of L over (pj, ...,p-) correspond bijectively 
to the 2”~! distinct choices for aj,...,@m modulo inversion. It follows that 
I fixes p. Hence Claim 2. 


Extending the action of [ from L to L: 

Let p be a point of L over p, where p is as in Claim 2. Since p lies over 
the point (p1,...,pp) which is unramified in the extension L/Q(z1,..., Lr), 
the point p gives rise to a splitting of the basic exact sequence 


1 = G(L/Q(zi,..27)) — G(£/OQ(n,..,2,") — T — 1 


in the usual way: The stabilizer I of p in G(L/Q(21, ...,2,)') effects the 
splitting. The induced isomorphism I — I extends the action of [ from 
Q(21,...,2,) to L. This action agrees with the action of ! on LZ constructed 
above because the latter fixes p. 


Corollary 5.2 The group PSpn(q), q odd, occurs regularly over Q ifn 2 
4(q+1) and one of the following holds: 

(1) q ts a square. 

(2) n/2 1s divisible by 4 and q=5 or7 mod 12. 

(3) n/2 1s odd and q=3 or5 mod 8. 


Proof : Take S = F} (a cyclic group of order g—1) or S= {a € Fy? | 
Ne 2/7, (4) = 1} (a cyclic group of order g +1) such that if q is a square 
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hen S has an element of order 4, and if g is not a square and m = n/2 
js even (resp., odd) then S has an element of order 3, but none of order 4 
resp., 5 has an element of order 4, but none of order 8). Such S exists in 
each of the cases (1)-(3). 

Ifn > 4(q+1) then there are elements aj,...,a2¢ € S\{41}, where 22 =m 
or 22 = m—1, that form a rational tuple, such that the a; + a7’ generate 
the field F,. Then a, ---ax = 1. If m is odd let am be of order 4. Then 
jn any case, if g is not a square then (—1)™~1a,---a, is not a square in 
S. Let t; =a; + ag, and t = (t),...,tm). Then the associated tuple C(t) = 
(Ci, ...,C,) is rational. By Lemma 4.1 there is a tuple (O14, 50m, 71; 72) in 
N(t). Let g1,...,gr be the images of these elements in G = PSpy(q). We 
may assume g,_; € C,_,. Then g, € C, by Theorem 2.2(v). By Lemma 4.2, 
G is generated by gj,...,g-. Hence V(t) # 0, and so there is an extension 
Qo/C(z) as in Theorem 5.1. 

We can further modify aj,...,a;, such that the hypothesis of Theorem 
5.1 (ii) resp. (iii) holds. Then the field M = L! resp. M = L! is ra- 
tional over Q, say M = Q(t),...,t-). Since M contains the minimal field 
of definition Ly of Qg, there is Qyy C Qe such that restriction gives an 
isomorphism G(Q9/C(z)) — G(Qy/M(2)) = G(Qy /Qti,..., tr, xv). Since 
G = G(Qo/C(z)) it follows that G occurs regularly over Q. 
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Abstract 


Let X be a semistable curve over a complete local ring and let 
p:Y — X be a tame admissible cover of the special fiber. To lift p to 
a tame admissible cover p: ¥ — X, it suffices to lift p locally in small 
neighborhoods of the singular points. The present paper gives a proof 
of this result using formal patching. As an application in the case of 
smooth curves, a proof of Grothendieck’s Theorem on the tame fun- 
damental group of smooth projective curves in positive characteristic 
is included. 


Introduction 


Let X be a semistable curve over a complete local ring and let p: Y ~ X bea 
tame admissible cover of the special fiber. Choose a horizontal divisor D C X 
lifting the branch locus D C X of p. To deform f to a tame admissible cover 
p:Y — X, ramified along D, it suffices to lift f locally in small neighborhoods 
of the singular points. To do so, one has to choose roots of local parameters of 
the singular points of X. Hence, deformations of p exist in general only after 
a tamely ramified extension of the base ring and are in general not unique. 
In the case of smooth curves, however, deformation of tame admissible 
covers is always possible and is unique. ‘This result was first proved by 
Grothendieck and used in his theory of specialization of fundamental groups. 
[8]. In [9] deformation of mock covers is studied and applied to tame funda- 
mental groups. These results can be reformulated in terms of deformation of 
tame admissible covers which are unramified over the singular points. In [23]. 
deformation of admissible covers over complete discrete valuation rings are 
described. This is used to construct a specialization morphism of fundamen- 
tal groups of curves with semistable reduction. The fundamental group of the 
special fiber, classifying admissible covers, is described by a graph of groups. 
There are also many results on deformation of covers of curves which are not 
admissible. They are all proved using some version of either formal or rigid 
patching. Let us only mention the result of Harbater [10] that every finite 
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group is a Galois group over Q,(r) and the proof of Abhyankar’s conjecture 
in [22] and [11]. 

The deformation theorem proved in this paper is used in [13], [19] and [23] 
to compactify Hurwitz spaces. For this application it is important to have By 
precise uniqueness statement and to work over quite general complete loca] 
rings. Even though rigid patching has often shown to be the more flexible 
approach, in particular for covers with wild ramification, in the present sity. 
ation formal patching seems to be more appropriate. The potential and the 
mechanisms of formal patching are certainly well known to algebraic geome. 
ters. But there seems to be no reference for this particular result which js 
reasonably self contained and accessible for a wider audience. 

Therefore, the present paper has two goals. First, to give a rigorous proof 
of the general deformation theorem of admissible covers. Second, to make 
formal patching and its application to fundamental groups more accessible to 
non-specialists. 


The main result 


Let R be a complete noetherian local ring with separably closed residue field 
k and let X/R bea projective nodal curve. By this we mean that the special] 
fiber X := X xpk has at worst ordinary double points 71,...,2, € X as 
singularities and their complete local rings on X are of the form 


Ox 2: | Riu, v | Uy; = ALP 


where the isomorphism is induced by elements u;,v, € Ox,2, with t, := ujv; € 
R. The elements u;,v; can even be chosen from the henselian local ring 
Ox2, C Oxz,. In other words, X is locally around z; (in the étale topology) 
isomorphic to the standard nodal curve Spec R[u;, v;|u;v; = t,]. 

Let D C X bea mark on X, ie. a horizontal divisor which is étale 
over Spec R and does not meet the singular points. A tame admissible cover 
p: ¥ — (X,D) is a finite morphism between nodal curves, which is tamely 
ramified along D, étale over X*™ — D and verifies the following condition over 
the singular points. Let y, € Y be a point lying over one of the singular points 
z;. Then y, is a singular point of Y/R, i.e. Oy,y, = Ri[s,, 3;|773; = 7] with 


n 
T, := 1,8; € R. Moreover, we can choose r,,s, € Oy, such that rj’ = wu, and 


35? = v; for an integer n; prime to the characteristic of & and u,,v; € Ox. 2: 
as above. 

Suppose we are given a tame admissible cover p : Y — (X,D) of the 
special fiber X := X xpk of X. For every point y, € Y lying over 4 
singular point x; € X we can choose f,,3, € Oy, with r,s, = 0, Fo = ti 
and 3? = 0v;, where u,,v; are as before and w,, 0, denote their restrictions 
to the special fiber. A deformation of p to R is a tame admissible covet 


p:Y¥ — X with pxpgk =p. For every deformation p of f there are unique 
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lifts rj,8, € Oy, of 7,3; with od = M6, 3)? =v, and 7, :=7r;s; € R. Thus 
the deformation p determines a tuple (7,), of elements of R with tT, ee 
Let us call the tuple (7,) the deformation datum for p corresponding to the 


deformation p. Our main result can be stated as follows. 


Theorem: The assignment 


p e— (Tt) := 785); 

induces a bijection between isomorphism classes of deformations of p to R 
and the set of tuples (t,), of elements of R with "hd =t;. Moreover, the iso- 
morphism between two deformations with the same deformation datum (r;); 
is unique. 

This theorem appears as a claim in [13] (page 61 f) for simple covers and in 
[19], §3.23 in the same generality as above. In the case that R is a complete 
discrete valuation ring and D = Q@ it is proved in [23]. The surjectivity of the 
map in the theorem is proved in [12] for the case R = A[[7),,...,T]]. Other 
special cases of the theorem appear in various papers dealing with Galois 
action on fundamental groups, e.g. in (14). 


Outline 


Section 1 serves an an introduction to formal patching. We explain the general 
idea of the proof and present the necessary tools, namely étale localization. 
descent and Grothendieck’s Existence Theorem. In Section 2 we show that 
every tame admissible cover is etale locally isomorphic to a cover of a certain 
standard shape. To show this, we introduce local coordinate systems of a 
nodal curve at an ordinary double point. Section 3 contains the proof of the 
main result. In Section 4 we give a proof of Grothendieck’s theorem on the 
tame fundamental group of curves in positive characteristic. The appendix 
contains some results about étale ring extensions and henselization which are 
used in this paper. 

Throughout, we assume that the reader is familiar with the definition 
of a scheme. However, we have tried to keep the references as accessible 
as possible. Most proofs make only references to Hartshorne’s book or a 
standard textbook on commutative algebra. 


I thank David Harbater and Helmut Volklein for helpful discussions about 
the content of this paper. 


1 Formal patching 
This section is an introduction to formal patching, as it is understood in 


this paper. The reader familiar with this circle of ideas can skip it without 
problems. The general reference for this section is [8]. 
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bo : 
= 
ie 


We start in 1.1 with an outline of the proof of our main result from q 
more general point of view. More precisely, we explain those ideas of the 
proof which are independent of the special case of admissible covers. This 
outline motivates the following subsections, where the tools we need to do 
formal patching are presented. These are étale localization, étale descent and 
Grothendieck’s Existence Theorem. 


1.1 Outline of the proof 


Lit Let R be a complete local ring with residue field k. Let X be a 
scheme over R. We write X := X xprk for the special fiber. Let P be a 
property of morphisms of schemes which is local, in an appropriate sense. To 
fix ideas, we assume that a morphism with property P is finite. 


Problem 1.1.1 Suppose we are given a morphism p: Y + X with property 
P. Does there exist a morphism p : Y — X with property P such that 
=) xp k? 


The present paper deals with this problem in the case that a morphism 
with property P is an admissible cover of curves. In the rest of this section 
we show that, under certain general conditions on the scheme X and on 
the property P, Problem 1.1.1 can be solved using standard techniques of 
algebraic geometry. 


1.1.2 The main idea is to solve Problem 1.1.1 first locally and then to 
glue the local solutions together to a global solution. To do this, we choose 
an open covering (U;),er of X. Then (U; := U;M X); is an open covering 
of X and (V; := p-1(U;)); is an open covering of Y. Moreover, the maps 
pi: V; — U; induced by pf have property P (since P is a local property). For 
t,j ET we let Ui; = UNU,, U5 = OU, 00; and V,; = VN Vj. 


Condition 1.1.2 If the covering (U;); is chosen sufficiently fine, then the 
following holds. 


(i) The morphisms f, : V; — U; can be lifted to morphisms p; : V; — Ui 
with property P. 


(ii) For 2 # 7, let U be an open subset of U;;, and let U := UN X and 
V := p-*(U). Given two morphism p, : V; + U and p2: V2  U lifting 


ply : V — U and having property P, there exists a unique isomorphism 
a: V, — V2 with ppoa =p and aly = Idy. 


Provided we use the right notion of an open covering, Condition 1.1.2 is 
sufficient to solve Problem 1.1.1. Given the local lifts p; : V; — U; of (i): 
Condition (ii) makes sure there are unique isomorphisms 


aij: py '(Uis) TS peUiy) (1) 
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jth pi ° O;,; = p; and a;; = Id. Their uniqueness forces the a; to verify 
he cocycle relation 


Tl Pel CS) . Oke, a Qidlyzt(Uy, k)? (2) 


here Uj5.4 = U; NU; OU, for t,j7,k € 7. In this situation we can glue the 
chemes V; along the isomorphisms a; ; and obtain a scheme Y together with 
a morphism p: Y — X such that p; = ply,. It follows that p is a lift of p 
with property P, solving Problem 1.1.1. 

In the case that the U; are Zariski open subsets of X, this gluing process is 
given as an exercise in [14] II, Exercise 2.12. But it is in general very difficult 
to choose a sufficiently fine Zariski covering such that Condition 1.1.2 holds. 
An elegant solution for this problem is to replace the Zariski topology by the 
finer étale topology. This means that we replace open subsets U; C X by 
étale morphisms U’; — X and the intersections U; NU; by the fiber products 
U; xx U; (for the moment we will keep the old notation). In the context of the 
étale topology, the gluing process described above can be accomplished with 
the theory of descent. This theory is a generalization of both Zariski gluing 
and Galois descent. We will see in Section 1.2 that the descent theorem we 
need here reduces immediately to an algebraic lemma dealing with faithfully 
flat descent of modules. 


1.1.3 Whether we can find a covering (U;); of X verifying Condition 
1.1.2 depends of course on the property P. In Section 2.3 we will show that 
if P means being an admissible cover of curves, then we can find an etale 
covering of X of a certain standard shape. For such a covering Condition 
1.1.2 (i) is then easily verified. This is the only part of our formal patching 
process dealing with the special situation of admissible covers. In our version 
of formal patching, Condition 1.1.2 (ii) depends on the following étaleness 
condition. 


Condition 1.1.3 There is a dense open subset Up C X such that p:Y — X 
is étale over Up M X and all lifts p : Y — X of p with property P are étale 
over Uy. Moreover, for a sufficiently fine open covering (U;); of X and for 
2 #7 we may assume that U;,; C Uo. 


In the case of admissible covers of curves, one can take for Up the complement 
of the branch locus inside the smooth locus of the curve. Then Condition 1.1.3 
holds if & is algebraically closed. 

Assume that Condition 1.1.3 holds. Then the morphisms ,,; : Vi; — Ui; 
are étale and any lift of p;; to pi, : Vi; > Ui; has to be étale. The following 
lemma shows that if R is artinian, then Condition 1.1.3 implies Condition 
boa) 


Lemma 1.1.4 Let A be a ring with a nilpotent ideal I; let A:= A/I. Then 
every finite étale A-algebra B lifts uniquely to a finite étale A-algebra B. 
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If A is artinian, then m” = 0 for n >> 0. We may assume that the Open 
subset U' C Uj; of Condition 1.1.2 (ii) is affine, U = Spec A. Then J := m4 
is a nilpotent ideal of A and V = p7!(U’) = Spec B for a finite A-algebra, 
Assume moreover that Condition 1.1.3 holds. Then we may assume that B 
is etale over A, and Condition 1.1.2 (ii) follows from Lemma 1.1.4. 


1.1.4 We have seen that we can solve Problem 1.1.1 if R is artinian and 
the Conditions 1.1.2 (i) and 1.1.3 hold. We would like to extend this result 
to the case that R is a complete noetherian local ring. The problem is that 
we can not apply Lemma 1.1.4 in the same way as before. Note that even if 
A is a finitely generated R-algebra, A is in general not complete with respect 
to the ideal J := mA. It is easy to see that the analogous version of Lemma 
1.1.4 is actually false in this situation. To get around this difficulty we need 
a further condition. 


Condition 1.1.5 The ring F is noetherian and complete and X is a projec- 
tive scheme over R. 


For n > 0 let Ry := R/m"*! and X, := X xg R,. Assume that the 
Conditions 1.1.2 (i) and 1.1.3 hold if we replace the R-scheme X by the R,- 
schemes X,. Then we can use Lemma 1.1.4 and étale descent to construct a 
sequence of finite morphisms p, : Y, — X, with property P such that Y, = 
Yaa X Raz, fn and Y = ¥o. In this situation and under Condition 1.1.5 we 
can apply Grothendieck’s Existence Theorem (see Section 1.4). This theorem 
shows that there is a finite morphism p : Y — X such that Y¥, = ¥Y xr R, 
for all n > 0. It remains to show that p has property P. Since this depends 
strongly on P, we formulate it as the last condition. 


Condition 1.1.6 If the morphisms p, : ¥;, — X,, all have property P, then 
p:Y¥ — X has property P. 


1.2 Etale localization 


We are going to introduce some terminology related to the étale topology of 
a scheme. All the definitions are restricted to affine étale morphisms. This 
is all we will need and it reduces the technical background. Throughout, X 
denotes a separated scheme. 


152.1 A morphism of schemes y : U — X is called affine étale, if 
U = Spec A is affine, its image y(U) is contained in some affine open subset 
SpecB C X and the ring extension B — A induced by ¢ is étale (see the 
Appendix for a definition of ‘étale’). 


Lemma 1.2.1 Let ¢ : U = SpecA — X be an affine étale map. Then 
for any affine open subset SpecB C X containing ¢(U), B — A is étale. 
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Moreover, if ~’ : U’ = Spec A’ — X is another affine étale map, the same 
is true for the projection of the fibered product U xx U' to X and for any 
morphism U' — U of X-schemes. 


Proof: The first claim follows from the fact that ‘éetale’ is an open condition 
(see the Appendix). For the second claim, take V := Spec BN Spec B’, where 
Spec B C X (resp. Spec B’ C X) contains y(U’) (resp. ¢(U"')). By [14] I. 
Ex. 4.3, V = Spec B” is again affine (here we use that X is separated). By 
the construction of fibred products in [14] II], Thm. 3.3, we get U xy U' = 
(Une *V) xv (U'N(0')*V) = Spec A” with A” = (A@gp B")@pn (BY @pB A). 
The B"-algebra A” is étale by Lemma 3.1.2 (i), proving the second claim. 

If there is any .Y-morphism f : U' — U, then we can take one Spec B C 
X containing both ¢(U) and y’(U"), and then f is given by a B-algebra 
morphism A’ — A, which must be étale by Lemma 5.1.2 (ii). This proves the 
Jast claim. a 


An (affine étale) covering! U/ of X is a family (y; : U; — X)j;¢ of affine 
étale morphisms whose images cover X. 

Given a covering U = (U; — X)jer, we will frequently use the following 
notation. 

C13 tm U; Xx Oy ’ Oi 5k := U; Xx Se xx Ux, pk ET. 
Note that, if the U; are Zariski open subsets of X, then we actually have 
U3; = U;NU;, Uign = U;NU; AU,. We have lots of natural maps: 
=— U; = Vis =- es Xe (3) 
t i) i,k 

The two arrows in the middle we call pi (projection to the first factor, C7) 
and pi) (projection to U’;). On the right hand side, we have three maps hel 
l= 1,2,3, for the projection leaving out the /-th factor. 

By Lemma 1.2.1, Uj; = Spec A;,;, Vij,4 = Spec Aije. Therefore (3) cor- 
responds to a complex of ring morphisms: 


=@ Ap Ss A" =@ Ayy t AN = @ Aji. (4) 
1,7,K 
Lemma 1.2.2 Assume X = SpecA Wie Then: 
(i) Canonically, A” = A’ @, A’ and A" = A'@, A'@4 A’ 
(ii) The natural morphism A — A’ is faithfully flat 
Proof: By the construction of fibered products in the affine case, A;; = 
A, ®4 A, and Ajj, = A; @a A; @4 Ax. This proves (i). A — A’ is etale. 
therefore flat. It is faithfully flat because Spec A’ = , U; ~ X = SpecA is 
surjective ({17] 4.C (iii)). - 


‘note that we give ‘covering’ (recouvrement, Uberdeckung) a different meaning than 
‘cover’ (revetement, Uberlagerung) 


— WEWERS: Deformation of tame admissible covers of curves 


1.2.2 Let k bea field. A geometric point of X is a scheme morphisry 
x: Speck — X such that k is an algebraically closed field. Let x : Speck Be 
X be a geometric point. An (affine étale) neighborhood of z is a Dair 
(C', 2’), where U’ —_X is an affine étale morphism U’ — X and 2’: Speck — pr 
a lift of 2 to U’. Frequently, we will write U instead of (U, xz’) and call it simply 
a neighborhood of 2. Let E1(.Y,x) denote the category of neighborhoods of 
z. A morphism (U',,.r1) — (U2. 22) is a commutative diagram 


Speck = Speck = Speck 
| 7 are ie 
Cy — U, a X 


If a morphism from U, to U2 exists, we will say that LU, is smaller than U, 
Let Ei’(.X,r) be the full subcategory of all connected neighborhoods. 

It follows from Lemma 3.2.1 that Ei’(X,2) is a filtered inverse system. 
Therefore we can define the (strict) henselian local ring of X at z as 


Ox ,"= lim A, U'=SpecA € Ei’(.X,z) (3) 


Of course, for every neighborhood U = Spec A of x, Ox; is the henselization 
of A — k, as defined in the Appendix. We define the (strict) complete 
local ring Ox,, of X at x to be the completion of Ox. 


1.2.3 Local decomposition of finite morphisms Recall that a mor- 
phism f : ¥ — X of schemes is called finite, if for every open affine subset 
U' = Spec A CX the preimage f~!(U’) = Spec B is affine and B is a finite A- 
algebra, i.e. finitely generated as A-module. Then we can write Y = Spec B, 
where B is a finite Ox-algebra on X, i.e. a coherent sheaf with an additional 
algebra structure (see [14], 1].3 and II.Ex. 5.17). 

Let f : ¥ = SpecB — X bea finite morphism and xz : Speck — X a 
geometric point. Since & is algebraically closed, Y x y Speck = Speck” for 
some n > 0. The n idempotents of k" correspond one to one to the lifts 
Yis-++3Yn :opeck —Y of cto Y. 

If U = SpecA — X is an affine étale neighborhood of z, then V := 
¥ xy U = Spec B, where B is a finite A-algebra. By Lemma 5.1.2 (i), 
V = Spec B — ¥ is an affine étale map. 


Lemma 1.2.3 For every sufficiently small connected étale neighborhood U = 
Spec A of x, V =Spec B is a disjoint union of the form 
—— Vi, Vi = Specd&, 


i=1 


such that V; = Spec B, — Y is a connected neighborhood of y;. Moreover. 


Oy, = lim B, @4 A’, 


’ 
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where A’ — k runs over E1'(A — hk). 
Proof: Let A be the henselization of A with respect to A kand B: 
Ba A. Since A — B is finite and A henselian, we obtain a feces es 
B = ©%,B, into local factors. For a sufficiently small A’ — k € Ei’ (A — hk). 
we have B @,4 A’ = 6;B; such that B, = B,@4 A. This proves the first 
claim. Checking the universal property of henselization (Remark 5.2.2) we 
see that B, is the henselization of B with respect to the ring morphisms 
B— k corresponding to y,. Now the second claim follows from the definition 
of henselization. | 


We will say that a neighborhood lU’ with the properties formulated in 
Lemma 1.2.3 decomposes the finite map f. We may also state this as 
follows: by choosing an arbitrarily small affine étale neighborhood U’ of r. 
its inverse image on Y is the disjoint union of arbitrarily small affine étale 
neighborhoods V; of y;. 


1.3. Descent 


A quasi-coherent sheaf M on a scheme X is an Oy-module which is, over ev- 
ery affine open subset U’ = Spec A, represented by an A-module Af. More pre- 
cisely, given an affine open covering (U, = Spec A, C X); of X, M corresponds 
to a family of A;-modules Af; together with isomorphisms AM; @4, Ai, — 
M;®.,A,j verifying a natural cocycle condition (where Spec A, ; = U;x x Uj). 
The theory of descent shows that this is still true if (U; = Spec A, — X); is 
an étale covering, as defined in the last section. Moreover, this also works 
for finite Ox-algebras B. Since the latter correspond to finite morphisms 
p:Y — X, descent is the right tool to glue finite morphisms, as described in 


L123. 


31 Let X be a scheme and M a quasi-coherent sheaf of O y-modules 
over X (see 14], I1.5). Given an affine étale map ¢ : U’ = SpecA — X, the 
inverse image y"M is a quasi-coherent sheaf on U’ = SpecA, so it can be 
written as 


where M is an A-module ({14] I] Proposition 5.4). If py’: U’ = Spec A’ — X 
is another affine étale map and f : U’ — U is an X-morphism, we can 
write (y’)"M = M’ and the relation y’! = yo f induces a natural A’-linear 
isomorphism 


M'S M@, A’ (7) 


Now let U = (y; : U; = Spec A, — X); be an affine étale covering and .M 


a quasi-coherent sheaf on X. Then y7M = Ad, where M; is an A,-module. 
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Using the notation introduced in Section 1.2, the relations ¢; op\) =n o pl?) 
induce a family aw = (Q;,;);,, of A;j-linear isomorphisms 
Qi): AM, @.4, Aj ; —=> vis @a, A (3) 


For each triple z,j,4, the family (a;,)) gives rise to Aj ;,.-linear isomorphism, 


(1) ; ’ ws 
Qs yk : M; @a, Aik is My, @a, Ai jk 
2) > ae 
al?) kK: M; a, Ai.jk =, MM), @ a, Aims (9) 


a” 
Qik: = M;@a,A Ne ce ae M; @a, Ales bes 


(3 
For instance, al?) 4 18 defined by a,;®, @) A, jk, where e; ae : Aij — Arjs is the 


natural morphism and where we have identified (Mi @a, Aij3) @a., Aige with 
M;, @a,; Aine and (M, @4, Aij) @a,, Aizen With M, @4, Aijx. The other two 
cases are similar. A tedious but formal verification shows that the following 
cocycle relation holds for every triple i,j,k: 


of) (3) (2) 
Os ik O Oi jk = Fi jk (10) 


Conversely, let (Af;); be a family of A;-modules and (a;,;);,; a family of A, j- 
linear isomorphisms as in (8). The datum (M,,a,,;) is a ed a descent 
datum on the covering & if the family of isomorphisms a4 , derived from 
the a;,; as in (9) verifies the cocycle relation (10). A morphs ‘sh descent data 
from (Ad;, a,,,) to (Nj, 3;,;) is given by a family of A,-linear maps f, : Af, ~ A, 
compatible with a; and §;,, in the obvious way. Another formal verification 
shows that 


M — (M675) (11) 


defines a functor from the category of quasi-coherent sheaves on X to the 
category of descent data on WU. 


Theorem 1.3.1 (Descent for quasi-coherent sheaves) The functor de- 
fined by (11) is an equivalence of categories. In particular, for every de- 
scent datum (M,,a;;) onU there exists a quasi-coherent sheaf M on X with 
iM = M;. 
Proof: For a proof of this theorem and much more general results, see e.g. (20] 
Chapter VII or [1] Chapter 6.1. We will explain how to reduce the theorem 
to a problem on faithfully flat descent of modules. 

First assume that X = Spec A is affine. Let (M,,a,,;) be a descent datum 
on U. Then Af’ := @,;M; has a natural structure of an A’ = @,A; module. 
Using Lemma 1.2.2 we obtain canonical A’-linear isomorphisms 


M'@4A'= CBM, 844, = OM, ®a, Ay 
t.) t,) (12) 
A'@4M' = €)M,@44,= DM, Oa, A 


t.) tJ 


WEWERS: Deformation of tame admissible covers of curves 249 


Therefore a descent datum (Adj, a;,;) on UW gives rise to a a datum (M’,a). 
where /' is an A’-module and a: M'@,4 A’ — A'@,4 M' is an A” = A'@,4 A’- 
Jinear isomorphism. The cocycle relation (10) corresponds to the relation 


a®@oqg = aq!) (13) 
with A”-linear isomorphisms 


a) : 4'@,A'@,M' — A’'Q@,M'Q, A’ 
a) : 4'@,A'@,M — M'@, AQ, A (14) 


~ 


a!) 7 A @a M’ Oa A’ a Mt’ Oa A’ @a A’ 


obtained by tensoring a on the left, in the middle and on the right with Id4:-. 
Remember that we are assuming X = Spec A and we can therefore identify 
a quasi-coherent sheaf M on X with the A-module M :=T(X,M). In this 
case we have Af’ = @,;M; = M @, A’. and a is defined by a; @ (m @ az) & 
(a, ® m) © aa. We can therefore reformulate the statement of Theorem 1.3.1 
as follows. Given a faithfully flat ring homomorphism A — A’, the functor 


M e- (Mf’ = M @a AX a) (15) 


is an equivalence between the category of A-modules and the category of 
descent data for the morphism A — A’. For a proof of this statement, see 
e.g. [18] I Remark 2.21. 

The general case of Theorem 1.3.1 now follows quite easily. We can cover 
X be affine Zariski open subsets V, = Spec B,. It is easy to see that for 
every j4 we can restrict our descent datum (M;,qa,,;) on U to a descent datum 
on the étale covering U|y, = (U; xx V, — V,) of V, = Spec B,. Applying 
Theorem 1.3.1 in the affine case, we obtain quasi-coherent sheaves M, on \,, 
corresponding to B,-modules M,. Using the fully faithfulness of the functor 
(11) in the affine case we get isomorphisms M,|v,qv, + Mul|v,nv,, because 
both sheaves correspond to the restriction of the descent datum (A/;, a;,;) to 
the affine open subset V, M V,. The uniqueness of these isomorphisms forces 
them to satisfy the usual cocycle relation. By [14] II Exercise 1.22 we can glue 
the M, to a quasi-coherent sheaf M. By construction, M corresponds to 
the descent datum (M;,a;,;). A similar argument shows the fully faithfulness 
of (11) in the general case. z 


1.3.2 Let X and U be as before and let B be a finite Ox-algebra. Regard- 
ing B as a quasi-coherent sheaf on X, we obtain a descent datum (Bj, ai.;). 
where the B, are finite A;-algebras and the a;,; are isomorphisms of A,_,- 
algebras. We say that (B,,a;,;) is a descent datum for finite Ox-algebras on 


U. 


1] 
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’ 


Corollary 1.3.2 The functor B ++ (B;,a,,;) is an equivalence between ihe 
category of finite Ox-algebras and the category of descent data for finite Ox. 
algebras on U. 


Proof: Let (B,,a;;) be a descent datum for finite Ox-algebras on UY. The 
algebra structure of the B; is given by A,-linear multiplication maps m; : 
B, @a, B; — B,. These maps verify certain identities corresponding to thie 
rules for multiplication in a ring. Using the fact that the a;) are algebra 
morphisms, the family (m;); is easily seen to be a morphism of descent data. 
By Theorem 1.3.1 we obtain a quasi-coherent sheaf B on X together with 
a morphism m : Bb ®o, B — B of quasi-coherent sheaves. The morphism 
m verifies the same identities as the m; and defines thus a structure of Ox- 
module on G. It remains to show that B is actually a finite Oy-module. Ip 
view of the construction of B in the proof of Theorem 1.3.1, this follows from 
the following fact. Let Af be an A-module and A — A’ a faithfully flat ring 
extension. Then ‘V is finitely generated over A iff Af’ := M @, A’ is finitely 
generated over A’. | 


1.4 Grothendieck’s Existence Theorem 


Let AR be a complete local ring with maximal ideal m and let X be a scheme 
over R. For n > 0, let R, := R/m"t! and X, := X xp R,. Note that NX, is 
a closed subscheme of X. In particular, X := Xp is the special fiber of X. 

A formal coherent sheaf on X is a family (Mz)n>0 of coherent sheaves 
M, on X, together with a system of isomorphisms 


Yam: Mn@r, Rm —> Mm. n>m (16) 
such that Yim ° (Ynm @r, Ri) = Yn for alln > m > I. For instance, 
given a coherent sheaf M on X, we can define the formal coherent sheaf 
M := (M @rR Ra)n, called the formalization of M. There is an obvious 
notion of morphisms between formal coherent sheaves. Formalization 1s 4 
functor 


M > M:=(M @pr Ra)n (17) 


from coherent to formal coherent sheaves. A formal coherent sheaf M’ on 
X is called algebraizable, if there exists a coherent sheaf M on X with 


M = M. 


Theorem 1.4.1 (Grothendieck’s Existence Theorem) Let R be a com- 
plete noetherian local ring and X a projective scheme over R. Then (17) 35 
an equivalence of categories. In particular, every formal coherent sheaf on X 
ts algebraizable. 
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The proof of this theorem in [7], Chapter 5, uses the cohomology of co- 
herent sheaves. In [14] this cohomological machinery is developed under less 
general hypotheses. Below we give a proof of Theorem 1.4.1 which follows 
closely the original lines of [7] but only relies on results proved in [14]. First 
we state the corollary which we will need to do formal patching. 


Let Y/R be as in Theorem 1.4.1 and let B be a finite Ox-algebra (see 
Section 1.2.3). Regarding B as a coherent sheaf, we define its formalization 
B := (B,). Each B, is a finite Ox,-algebra and the isomorphisms B, @r, 
Rm — Bm tespect the algebra structure. Therefore we call B a formal finite 
Ox-algebra. The following Corollary is implied by Theorem 1.4.1 in the 
same way as Corollary 1.3.2 is implied by Theorem 1.3.1. 


Corollary 1.4.2 Assumption as in Theorem 1.4.1. Every formal finite Ox - 
algebra is uniquely algebrazzable. 


1.4.1 Before passing to the proof of Theorem 1.4.1 we fix some notation 
and recall some general facts. Since X C Pp by assumption, we can consider 
coherent resp. formal coherent sheaves on X as coherent resp. formal coherent 
sheaves on Pp. Therefore, we may assume that X = Pp. 

X = Py has a standard affine open covering (U; = Spec A,):, where A; = 
R{To/T;,... ,T,/T,],7 =0,...,r. Let M be a coherent sheaf on X. For every 
sequence t),... ,2, With 1 <a <<... <1, <r we put Uj, i, = Ui, N.. ANC, 


and Mj, ....i, := PCG... ip» M). Ina standard manner we obtain a complex 


Bu — OM; — BMuar — (18) 


F tJ tgak 


of R-modules (see [14] 11.4). Note that M; = [(Ui,M) (resp. Mi; := 
T(U;;,M) etc.) are finitely generated A,;-modules (resp. finitely generated 
A;,, = A,[T,/T;]-modules etc.). For g > 0 we define the cohomology group 
H4(.X,M) as the q-th cohomology group of the complex (18). By [14] III. 
Thm. 4.5 this coincides with the definition of cohomology via derived functors. 
In particular, H°(X,M) =T(X,M) are the global sections of M. 

Let M’ = (M,)n be a formal coherent sheaf. Note that we can consider 
M,, as a coherent sheaf on X. We will write ag”) = (UM); Mi) = 
T(U;;, Mn) etc. For n > m the natural morphism M, — M,» induces 
morphisms 


Mi") — MP", Mi) — Mi... 09) 


and hence a compatible system of morphisms H7(.X,M,) — H4(X,M~m).- 
We define the cohomology of a formal coherent sheaf by 


H?(X,.M’) := lim H*(¥.M,). (20) 


n 
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The main ingredient of the proof of GET is the Theorem on Formal Function, 
({14] III, Theorem 11.1). It states that for any coherent sheaf M on X and 
gq > 0 we have 


H*(X,M) = lim H°(X,M,) = H%(X,M). (21) 


n 


1.4.2 Let M’ = (M,), be a formal coherent sheaf on X = PR. We 
will say that M’ is generated by a finite number of global sections if there 
are elements m,... ,m, € H°(X,M’) whose images generate the A,-module 
a), for all 1 <2 <r, n> 0. Given a formal coherent sheaf M' and k € 7. 


we define its k-th twist of M’ to be M’(k) := (Mi(k))n (see [14] IL3). 


Proposition 1.4.3 Let M’' be a formal coherent sheaf on X = PR. Then for 
k > 0, M’'(k) is generated by a finite number of global sections. 


Proof: Let R:= @a>om"/m"t! be the graded ring associated to R. Consider 
the sheaf M := @n>om"M,, as a quasi-coherent sheaf on X:= P,. We claim 
that M is a coherent sheaf. In fact, M is generated as an Oy-module by its 
subsheaf Mo, which is a finitely generated Ox,-module. Note that we have 


H*(X,M) = G H(X,m"M,). (22) 


n>0 


By [17] 10.D, & is noetherian. Hence we can apply [14] III, Theorem 
5.2, to conclude that for & >> 0 the coherent sheaf M(k) is generated by 
a finite number of global sections and H4(X,M(k)) = 0 for all g > 0. In 
particular, Mo(k) is generated by a finite number of global sections and 
H'(X,m"M,(k)) = 0 for all n > 0 (use (22)). 

The short exact sequence 0 — m"*!Maii(k) — Magi(k) + M,(k) — 0 


yields an exact sequence 


H°(X,Masi(k)) > AO(X,Malk)) — A(X, m™Mayi(h)) = 0. 
( 


23) 
showing that the first morphism is surjective. Choose a set my,...,m € 
H°(X, Mo(&)) of global generators of Mo(k). Using inductively that the first 
map in (23) is surjective, we can lift them to global sections mj,... ,m € 
H®(.X, M’'(k)). By Nakayama’s Lemma, their images in H°(X,M,(k)) gen- 
erate M,, for all n > 0. This is exactly what we wanted to prove. ial 


1.4.3 Now we are going to prove GET. Let M, N be coherent sheaves 
on X. Then there is a coherent sheaf Hom(M, AN) such that Hom(M,N) = 


’ 
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H°(Hom(M,AN)) ([14] II.5). Formalizing this construction we obtain a for- 
mal sheat Hom(M,N) such that Hom(M,N) = H°(Hom(M,N)). The 


Theorem on Formal Functions (21) implies 
Hom(M,N) = Hom(M,N). (24) 


This proves the fully faithfulness of the formalization functor MH M. 

It remains to show that for any formal coherent sheaf M’ on X we can 
find a coherent sheaf M with M’= M. By Proposition 1.4.3 we can find & 
such that M’(k) is generated by a finite number of global sections. Therefore 
there exists a natural number / and a surjective morphism Of — M'(k) of 
formal coherent sheaves. Twisting with —k we obtain a surjective morphism 
f’ : Ox(-k)! — M’. Applying this procedure once more to the formal 
coherent sheaf Ker(/’), we obtain an exact sequence 


Ox(-k’)" 4 Ox(-k) & mM’ — 0. (25) 


The first two formal sheaves are obviously algebraizable. Hence the fully 
faithfulness of the formalization functor guarantees the existence of a mor- 
phism g : Ox(—k’)" — Ox(—k)! of coherent sheaves such that g' = g. Let 
M be the cokernel of g. Then M @rp R, = M, because both sides are the 
cokernel of g,. Hence M’ = M. a 


2 Tame admissible covers 


Tame admissible covers are finite morphism between relative nodal curves 
with a particular local ramification behavior. Our goal is to show that locally 
in the éetale topology all tame admissible covers have a certain standard shape. 
This is done in Section 2.3. Section 2.1 and 2.2 contain some preliminary 
results. 

Section 2.1 gives a detailed study of the strict complete local ring of a 
curve at an ordinary double point. The deformation theory of a curve in a 
neighborhood of an ordinary double point depends on the way one can choose 
and lift the so called formal coordinate systems. Since this will turn out to 
be crucial for the deformation theory of tame admissible covers, we do it 
very carefully. Note however that under the assumption of a reduced base 
ring (which suffices for most applications), proofs would become substantially 
simpler. 

In Section 2.2 we use the results of 2.1 to study étale neighborhoods 
of ordinary double points. Technically, this means to compare the strict 
complete local ring of such a point to the strict henselian local ring. We also 
give the analogous statements for smooth points, which are much easier to 
prove. 
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2.1 Formal double points 


Let A and R be complete noetherian local rings. Denote their maximal ideals 
by mq # and 9% q A and the residue field R/m by k. Let R — A be “5 
faithfully flat local ring extension. 

The ring A is called a formal double point over RF if there exists a Pair 
u,v of elements of A such that 


(i) t:=uvem, 
(ii) u,v induce an isomorphism A = R[[u, vjuv = t]] of R-algebras. 


In this case, the pair (u,v) is called a (formal) coordinate system for A/R. 
Any element f € A can be written uniquely in the form 


feat oaiui't S aly’ (26) 


i>0 i>0 
with do, a},a € R. We will call (26) the (u,v)-expansion of f. 


Proposition 2.1.1 Let A, R be as in the first paragraph of this section. As- 
sume that A:= A/mA is a formal double point over k. Then: 


(i) Every coordinate system (u,v) of A/k lifts to a coordinate system (u,v) 
of A/R. In particular, A/R is a formal double point. 


(ii) Every pairr,s € A with rs € m such that 
a, mS 
is a formal coordinate system. 


Proof: Let (u,%) be a coordinate system for A. Note that for every pair u,v € 
A lifting a, 0 the induced ring homomorphism R[[u, v]] > A is surjective, since 
it is surjective mod m. We have to find lifts u,v such that uv € R. To do 
this, we will construct inductively a sequence of lifts u,,vn € A of &,0 and 
a sequence of elements t, € m (for n > 0) such that uy, = Uaii, Un = Uns 
mod m"t!4,t, =ta4, mod m"t! and u,v, =t, mod m"t!, 

To start, take any pair uo, vo lifting u,v and let to := 0. Suppose we have 
already constructed un,Un,tn for some n > 0. Then we can write 


Un’, =t, + ) a; u,v), 
1,720 
=} I #9 d n+2 4 
ln + doo + 41,0 U, + d9;U, mod m 


z7>0 2>0 


with a,, € m™t!. For the first equality we have used the fact that an element 
of m"*' 4 can be written as a power series in u,,v,_ with coefficients in m"t!. 
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For the congruence we have used u,v, € MA. Let upg := Un — >> ao,; v2? 
J>0 
and Un+1 (= Un — Dl ajiout,'. Using (27) we obtain 
i>0 
Unsilng1 = Untn — 0 d,0U, — D> ao, v2 mod mt? 4 
2>0 J>0 (28) 
zt, + doo mod m"*? 4. 


Hence the induction step is done if we define ty4; := t,p+doo. Let u:= limu,. 
yp: lime, and t:=limt,. Then u,v lift u,v and uv =t € m. 

The following argument is taken from [6], Lemma 2.2. Consider the short 
exact sequence of R-modules 


0 —+ I — Riu, vjuv =t]] —~+ A — 0. (29) 


The homomorphism on the right is the one induced from the above choice 
of u,v € A. Since A is R-flat, (29) remains exact after reduction modulo m 
(see [16] XVI, Lem. 3.3). But &{[z,v|uv = 0]] — A/mA is an isomorphism by 
hypothesis, hence J = mJ. Since R[[u, v|uv = t]] is noetherian, the ideal J is 
finitely generated and Nakayama's lemma implies J = 0. This proves (i). 

Let r,s be as in Statement (ii) of the proposition and let (u,v) be some 
coordinate system for A/k. Denote by (F,8) the reduction of (r,s) to A. 
Write 


(30) 


with coefficients in k. Using rs = uv = 0 one finds step by step that the 
following holds: first, agb9 = 0, next dg = bo = O and finally that either 
ai = 6’ = 0 or a! = & = 0 for all 1 > 0 (we may assume the latter). 
By assumption the maximal ideal of A is generated by 7,5. This can only 
happen if ai, b/ 4 0. Therefore the power series in (30) are in one variable 
and invertible. This proves that (r,3) is a coordinate system for A/k. The 
argument used at the end of the proof of (i) shows that (r,s) is a coordinate 
system for A/R. = 


Proposition 2.1.2 Let A/R be a formal double point. Then the set of ideals 
{uA, vA} and the ideal tR (with t := uv) are the same for any choice of a 
coordinate system (u,v). 


Proof: First, let us assume that there exists a coordinate system (u,v) of A/R 
with wo = 0. Under this assumption, we will prove that for every coordinate 
system (r,s) of A/R, we have rs = 0 and either rA = uA and sA = vA or 
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rd=vAand sA=uA. Write 


/ 
r=dot+ ) au +5 are, 


t>0 1>0 (31 
$= bo + a bru’ + be brut. rd 
7>0 1>0 


Since (r,s) is a coordinate system, the (u,v)-expansion of rs consists only of 
a constant term. Computing the (u, v)-expansion of rs using (31) and uv = 9. 
one obtains 


n-1 
dobi, + 3 abe; tab = 0, 
t=] ‘ 
be (32) 
aby +S alb"_; + aby = 0, 
t=1 


for all n > 0. Reducing (31) mod m and applying the arguments from the 
proof of Proposition 2.1.1 (ii) (following (30)) we can assume aj,b/ #¥ 0 
mod m and a/,6, =0 mod m. For n = 1, (32) states 


abi a a, bo = () ’ agbt + ay bo = 0 (33) 


Rewrite this as bp = —(a,)~'agb, and ag = —(b/)~1a/bo. Plugging the second 
equation into the first yields bp(1 — (a)b/)~*a/b,) = 0. But the second factor 
is congruent to 1 mod m, hence bg = 0. By symmetry we obtain aj = 0. In 
particular, this proves rs = 0. 

Next we prove by induction that a?,b! = 0 for all z > 0. Assume that 
this is true for all 2 < N for some N > 0. Then (32) with n := N +1 states 
a, by = 0 and aby = 0, therefore 64, = ak, = 0. We have shown that r (resp. 
s) Is a power series in u (resp. v) starting with an invertible coefficient for the 
first power. This proves rA = uA and sA = vA and hence the Proposition in 
this special case. 

The general case follows easily. Let (u,v) be a coordinate system and 


t:= uv. The ideal J := tR is the minimal one such that uv = 0 for every 
coordinate system (u,v) of A/IA over R/I. The ideals uA and vA are the 
inverse images of uA/tA and vA/tA. a 


Let A/R be a formal double point, (u,v) a coordinate system and n a 
natural number. Define 
Pun i= Anny(u"A) = {ae A] au" =0} 4 A, 


34) 
Pin = Anna(v"A) = {a € A| av" =0}.4 A, \ 


and P, := Pui, P, := Py. Using the uniqueness of the (u,v)-expansion. a 


. 
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straightforward verification shows: 


Pun = {> ay! fat! =0 (i= 1,...,n-1), at" =0 (i> n)} CvA. 
1=] 


Po {So ayu' at? =0( =1,....n-1), at”? =0(0>n)} CUA, 
z=1 
where ¢:= uv € m. From this we see immediately that 
Pie ‘a hee = 1 oe ; Pos — (0) (36) 
and (inside A) 
i (Pain + Pon) = (0). (37) 


Proposition 2.1.3 (compare with [19] §9.7 and §3.8) Let (u,v) and (u’,v’) 
be coordinate systems of A/R and n > 1 such that uA = u’A. There are 
unique units a,b € A* with 

(i)? =cue (0 = bo abe R* 
Proof: Put ¢ := uv and t’ := u’v’.. Consider the case n = 1 first. By 
Proposition 2.1.2 there are units a’, b’ € A* with u’ = a’u, v' = bv. Let 


Co o 
ab scot Diguit Didiu', cc.cl € FR. (38) 
2=1 7=1 


be the (u, v)-expansion of a’b’. Using its uniqueness and t' = u'v’ = (a’b')t Ee R 
we conclude ct = c/t = 0 for every 1 > 1. Therefore (35) tells us that the 
sum in (38) is of the form 

ab=eotata, mEeR,y €P, aM EP, (39) 


which is unique by (36) and (37). 
Every pair (a, 6) of units of A with u’ = au, v’ = bv is of the form 


a=a+A,b=U+p, AEP, MER, (40) 
By (36) we have \y = 0, so using (39) and (40), we get the decomposition 
ab=a'bh+hA+a'p=egt (a +A) +(e 4+a’d) (41) 
in R+P,+ P,. Since it is unique, ab € R is equivalent to 
A=-(H)'e,, w= —(a')ley. (42) 


This proofs at the same time the existence and the uniqueness in the case 
ie a 

For general n, deduce the existence of a,b € A* with (u’)" = au”, (v')" = 
bv" and ab € R by using the case n = 1 and taking n-th powers. Any 
other pair a’, b’ € A* with the same properties can be written as a’ = a+ A, 
bo = b+, with \ € P,» and «x € Py». As above, one gets the unique 
decomposition a’b' = ab+ bA + ap € R+ Pin + Pyn, and one can deduce 
A = #t = 0, which proves the uniqueness of a, 6. a 
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2.2. Marked nodal curves 


2:2.1 Let R bea noetherian ring. A curve over R is a scheme X which js 
flat and of finite presentation over R such that all geometric fibers of X/R are 
reduced curves. A nodal curve is a curve X/R such that all geometric fibers 
have at most ordinary double points as singularities. We write XS" ¢ ag 
for the smooth locus of the morphism X — Spec R. A mark on a nodal 
curve X/S is a closed subscheme D C X°™ such that the natural morphism 
D — Spec RP is finite etale. Hence D = Spec R’ for a finite étale R-algebra 
R’. We call the pair (X/R,D) a marked nodal curve. 


Remark 2.2.1 Being a marked nodal curve is local in the étale topology. 
More precisely, let X be a scheme over a noetherian ring R, D C X a closed 
subscheme and (U; — X), an étale covering. Then (X/R,D) is a marked 
nodal curve if and only if (U;/R,D xx U;) is a marked nodal curve, for all j. 


252.2 Let X/R bea nodal curve, ka field and z : Speck — X a geometric 
point whose image is a closed point of X. Let X C X be the fiber of the 
morphism X —+ Spec FR containing the image of z. The composition of z 
with the morphism X — Spec A corresponds to a ring morphism R — k. Let 
R:= Ospec R,s (TeSP. R:= Ospecr,s) be the strict henselization (resp. strict 
completion) of R with respect to k. 

In this subsection we assume in addition that the image of z is a singular 
point of X. Since X/R is nodal we have Ox , = k[[ti,d|av = OJ]. We will say 
that z is a geometric double point of X/R. 


Proposition 2.2.2 Let X/R be as above and x : Speck — X a geometric 
double point of X/R. Then the complete local ring Ox, is a formal double 
point over R (see Section 2.1). 


Proof: By the definition of a curve, there is a Zariski open subset U = 
Spec A C X containing the image of z such that A is flat and of finite presen- 
tation over R. By Proposition 5.2.3 (i) and (ii) the complete local ring Ox.z 
is a local flat R-algebra. Let m:= Ker(R — k). Since U := Spec(A/mA) is 
an open subset of X containing the image of x, Proposition 5.2.3 (iii) shows 
that Oy, = Ox-/mOy,,. Hence the Proposition follows from the definition 
of geometric double points and Proposition 2.1.1 (i). a 

If x : Speck — X is a rational double point, Proposition 2.2.2 shows 
that there exists a pair u,v of elements of Ox, such that t := uv € R 
and Ox, = Rfu,vjuv = tl). Such a pair (u,v) will be called a formal 
coordinate system for X/R at r. Note that the ring extensions R — Ff 
and Ox.. + Ox. are faithfully flat, in particular injective (Proposition 3.2.3 
(11)). We will call a pair u,v of elements of Oxy, a coordinate system for 
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X/R at x ift:= uv € Rand (u,v) (regarded as a pair of elements of Ox -) 
is a formal coordinate system. 


Proposition 2.2.3 Let X/R be as before and x : Speck —- X a geometric 
double point. There exists a coordinate system (u,v) for X/R atx. Let 
(u,v) be any coordinate system for X/R at x and R' € E1(R — k) such that 
t:= uv € R’. Then for every sufficiently small affine étale neighborhood 
Ui = Spec A — X of'x, the natural morphism R'[u,v|uv = t] — A is étale. 


Approximation Theorem. We will not give the details of the argument, be- 
cause a much more general statement is proved in [3] XV, Corollaire 1.3.2. 
For the second assertion, let (u,v) be any coordinate system for X/R at x. 
By definition, t := uv € R, hence we can find R’ € Ei(R — k) with t € R’. 
Let U = SpecA — X be a neighborhood of z. Replacing A by A @p R’ 
we may assume that A is an R’-algebra. By the definition of Ox, we may 
assume that u,v € A. We obtain a natural morphism R’[u,vluv = t] — A. 
Since l’ = Spec A — X is a neighborhood of x, A is equipped with a nat- 
ural morphism A — k; the composition R’{u,vjuv = t] — k sends u,v,t to 
0. Taking the completion of the rings R’{u,v|wv = t] and A with respect 
to the morphism to k, we obtain an isomorphism R[[u,v|uv = ¢]] = Ox. 
Hence, by Proposition 5.2.3 (v) the morphism R’[u, vjue = t] — A is étale in 
a neighborhood of the maximal ideal Ker(A — k). Replacing U = Spec A bv 
a Zariski open neighborhood of this point completes the proof of the Propo- 
sition. be 


Given a coordinate system (u,v) of a geometric double point on X/R. a 
pair (R’,U = SpecA — X) as in Proposition 2.2.3 will be called a coordi- 
nate neighborhood for (u,v). Frequently we will omit the ring R’ from our 
notation. 


2.2.3 Let A, X and x: Speck — X be as in the first paragraph of 2.2.2. 
Now we assume that x is a geometric smooth point on X/R. By this we mean 
that the image of x is a smooth point of the fibre X C X on which it lies. In 
addition, let D C X be a mark on the nodal curve X/R. For an affine étale 
neighborhood U = SpecA — X of a, let Dy := D xx U. Then (U, Dy) is 
again a marked nodal curve. Since U = Spec A is affine, Dy = Spec(A/J) 
for an ideal J q A ({14] II, Corollary 5.10). By definition, R — A/J is finite 
étale. There are two cases to consider. First, if the image of z lies on X — D. 
then for any sufficiently small neighborhood U’ = Spec A — X of z we have 
I= Aand Dy =@. On the other hand, if x lies on D. Proposition 5.2.3 (iii) 
and (v) imply 


R=Opz=Oxe2/l10x 2, R=Opz = Oxz/[lOx,s- (43) 


—— OO ———Eeee ~~ 
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Proposition 2.2.4 Let x: Speck — X be a geometric smooth point on the 
marked nodal curve (X/R,D). There erists an element z € Ox,2 such tha; 
for every sufficiently small affine étale neighborhood U = Spec A — X of r 
the natural morphism Riz] — A is étale. Moreover, if x lies on D, we can 
choose z such that Dy is defined by the equation z = 0. 


Proof: We will assume that z lies on D. Since x is a smooth point of ¥ 
we have Oy, = A{[Z]]. Similarly to what we did in the proof of Proposition 
is 
any litt of = to Ox». Let U = SpecA — X be a neighborhood of zx ang 
Iq A such that D xx U = Spec(A/I). Let I := IOx,. By (43) we have 
Ri{z]|/1 = R. Therefore z—a € I for some a € thd R. We may replace = by 
z +a, hence z € I. Now Nakayama’s Lemma implies [ = zOx 7. Using [17] 
24.E (i) we conclude that IOx 2 = 2'Ox,. Actually, we may assume that 
z= 2' € Ox,,. After shrinking U’ = Spec A we may assume that z € A and 
I! = zA. Completing the natural morphism R[z] — A with respect to k we 
obtain an isomorphism Ri[[2z]] — Ox... Hence, after shrinking U = SpecA a 
little bit more, R[z] — A will be étale. | 

If x lies on D, an element z € Ox, with Ox, = Rife] and [ = SOu d 
will be called a formal coordinate for D at x. If z is moreover an element 
of Ox., it will be called a coordinate for D at z. 


2.3. Tame admissible covers 


2.3.1 Let (X/R,D) be a marked nodal curve over a noetherian ring R 
(see Section 2.2) and p: Y —+ X a finite morphism of schemes. Moreover, let 
y : Speck — Y bea closed geometric point and z := poy. Since p is finite. 
it induces finite local ring extensions 


Ox2 —> Ovy, Oxz — Ovy. CH 


As in Section 2.2, R (resp. R) denotes the strict henselization (resp. the strict 
completion) of A with respect to k. The first (resp. second) arrow in (44) is 
a morphism of R-algebras (resp. R-algebras). 


Definition 2.3.1 (see [13] §4) Let p, x and y be as above. We say that 
p:Y — X is tame admissible at y, if the following holds. 


(1) Assume that x is a geometric double point. Then Oy, is a formal double 
point over R. Moreover, there exist a formal coordinate system (u,v) of 
Ox,z, @ formal coordinate system (r,s) of Oy, and an integer n primé 
to the characteristic of k such that (44) sends u tor” and v to s". 


(1i) Assume that x is a smooth point lying on D. Then there exist a formal 
coordinate z € Ox, for D, an element w € Oy, and an integer n prime 
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to the characteristic of k such that Oy, = R[[w]] and (44) sends z to 
ao". 


(iii) If x is a smooth point not lying on D then Oy, = Ox. 


The morphism p is called a tame admissible cover of (X/R.D) if it is 
tame admissible at every geometric point y of Y. 


Part (ii) and (iii) of Definition 2.3.1 are usually stated as follows: the mor- 
phism p is étale over X*™ — D and tamely ramified along D. Let 


Rev2(X):={p:¥—X} 
be the category whose objects are tame admissible covers of (X/R,D) and 
whose morphisms between objects ¥ — X and Z — X are morphisms of 


schemes } —+ Z compatible with the maps to_X. 


2.3.2 Consider the following situation. Let (X/R, D) be a marked nodal 
curve and p: Y — X a finite morphism. Let y : Speck — Y be a geometric 
closed point such that x := poy is a geometric double point of X and such 
that p is admissible at y. We want to study p in a neighborhood of y. Let 
U = Spec A — X be a neighborhood of x which decomposes p (see Section 
1.2.3). Then there is a unique component V = Spec B of U xx Y which is a 
neighborhood of y. We will refer to V as the local inverse image of U at y. 


Proposition 2.3.2 Let (u,v) be a coordinate system at x. If U = Spec A — 
X is a sufficiently small coordinate neighborhood for (u,v) then 


Bie Ars | 7 1.3" S073 = 7, 


where V = Spec B is the local inverse image of U at y, n an integer prime to 
the characteristic of k andt € ROA. In particular, Y/R is a nodal curve in 
a neighborhood of y and (r,s) is a coordinate system at y. 


Proof: Let A:= Cy,, B:= Oy i A:= Ox. and B:= Oy,. By hypothesis 
we have u,v € A, t:= uw € Rand A = Ri[u,vjuv = t]]. By Definition 
2.3.1 there exist formal coordinate systems (u',v’) for A and (r’,s’) for B 
such that (r’)" = u’ and (s’)" = v’. From Proposition 2.1.2 we know that 
there exist unique elements a,b € A* with u = au’, v = bv’ and ab € R*. 
By Hensel’s Lemma we can choose c,d € A* with c? =a and d* =}. Then 
(cd)" = abe R*, therefore we have cd € R*, by Hensel's Lemma. Let 
Sse aa 6 2=-de's Then t= a 3 =vandr:=rse€ &. But r*,s" € B 
and rT” € R. hence by pe ere 3.2.3 (vi) we have r,s € Bandreé R. 

Let B, := Alr, gle? U8" ees: = T|y Binge A — B, is a finite 
local extension. B, is henselian. There is a natural local morphism B, — B. 
Taking completions at both sides we obtain an isomorphism, because the 
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completion of B, is easily seen to be isomorphic to B= Rilr, s [rs = TI). 
With Proposition 3.2.3 (v) we conclude B = B,. Therefore, if U = Spec .4 
is sufficiently small, V = Spec B is as stated in the Proposition. The other 
statements follow easily. a 


We need a slight strengthening of Proposition 2.3.2. Let ¥ C ¥ be the 
fiber of Y/R on which y lies. Then Oy, = Oy,/mOy.,, where m is the 
maximal ideal of R (compare with the proof of Proposition 2.2.2). Let (u, v) 
be a coordinate system for V/A at x. Write u,v for the image of u,v in 


On = OvefmO vz: 


Proposition 2.3.3 Notation as above. Let (7,3) be a coordinate system for 
¥ at y such that r® = u and 8" =. Then there is a unique coordinate system 
(r,s) for Y at y lifting (7,3) such that r® = u and s" = v. 


Proof: By Proposition 2.3.2 we can choose a coordinate system (r’,s’) for 
Y at y with (r’)" = u and (s’)" = v. Then the coordinate system (r,s) we 
are looking for is of the form r = ar’, s = bs’ with a,b € OF, and ab € RX. 
Then r™ = u and s” = v is equivalent to a(r’)” = (r’)" and 6"(s')” = (s’)?. 
By Proposition 2.1.3 this is equivalent to a* = 1 and 6" = 1. Applying 
Proposition 2.1.3 to Oy, we find that (r,s) is a lift of (7,3) if and only if the 
reductions a,6 of a,b are uniquely determined n-th roots of unity. Now the 
Proposition follows from Hensel’s Lemma. a 


2.3.3 We continue with the notation fixed in the first paragraph of 2.3.3, 
but this time under the assumption that x is a smooth point of X/R lying on 
Dc X. In analogy to Proposition 2.3.2 and Proposition 2.3.3 we can show 
the following. 


Proposition 2.3.4 Let < be a coordinate for D at x. If U = SpecA — X 
is a sufficiently small coordinate neighborhood for z and V = Spec B its local 
inverse image at y. then 


B= Alw |." Ss]. 


Here n is an integer prime to the characteristic of k. As an element of Oyy: 
w is uniquely determined by z and the image of w in Oy, = Oy,/mOy,. 


3 Deformation theory 


This section contains the main deformation result. In 3.1 we give the nec 
essary notation and the precise statement (Theorem 3.1.1). Moreover, we 
sketch how this theorem can be generalized to the case of a non algebraically 
closed residue field, using Galois descent. 
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3.1 Statement of the main result 


B.1.1 Let R be a noetherian complete local ring with algebraically closed 
residue field k = R/m. Let (X/R.D) be a marked nodal curve where X’ is 
projective over R. We write X := X xpk for the special fiber and D := 
i) x R eA 

Let p: ¥Y — X € RerP(X) be a tame admissible cover. A deforma- 
tion of p to Ff is a pair (p,A), where p: Y — X¥ € Rev2(X) is a tame 
admissible cover and A: ¥ xp k — Y is an isomorphism in the category 
Rev? (X). An isomorphism from one deformation (p,,A,) to another (2, A2) 
is an isomorphism f : ¥; — ¥2 with pjo f = p; and A, 0 (f x Idg) = A). Let 


Def;(R) = { (p.A) }/ (45) 


be the set of isomorphism classes of deformations of p to R. Most of the time 
we will identify the special fiber of a deformation with Y,, i.e. we will assume 


A = Idy and write p € Def,;(R). 


3.1.2 Before stating the main theorem we have to fix some notations. 
The special fiber X has a finite number of ordinary double points, which 
we denote by 7,... ,2, € X (they can be identified with the corresponding 
geometric points z; : Speck — X). Let I’ := {1,...,r}. By Proposition 
2.2.2 we can choose coordinate systems (u,,v;) for X at z,,i € J’. Recall that 
u,,v, are elements of the strict henselian local ring Ox .,, such that t; := u,v, 
is an element of R and Oy,, = Rifu;.v,|ujv; = t,]]. We write w;,0; for the 
image of u,,v, in Ox, = Ox2,/mMOx,,. Then (t,,%,) is a coordinate system 
for X at z,. 

Fix « € I’ for a moment and let p-'(z;) = {y, € Y | j € J!} be the 
fiber of p over z,, indexed by a finite set J’. By Proposition 2.3.2 there exist 
coordinate systems (7;.8,) for Y at y, such that te aus and @2 = 0, for 
integers n, prime to the characteristic of k, for all j} € J’. Let J’ be the 
disjoint union of the sets J! for i € I’ and let « : J‘ — I’ be the map sending 
j € JE to i. 

Let p € Def,(R) be a deformation of p to R. For j € J‘ andi := k(J), 
Proposition 2.3.3 shows that there is a unique lift (r,,s;) of (7),3;) to a 
coordinate system of Y at y, such that oe =@, antl 8? = v;. Note that 
T, := 7,8, 1s an element of R with oi = t,. This defines a map 

Def;(R) — fe s (46) 


(p, A) ee (7 = 8, piers 


where T(R) := {(t))er | € R, Te = t,))}. An element (7,), of T(R) will 
be called a deformation datum. Note that the map (46) depends on the 
choice of the coordinate systems (u,,v,) and (r,.s,). Using the assumptions 
and notations of 3.1.1 and 3.1.2, we can now state the main theorem. 
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Theorem 3.1.1 The map (46) is bijective. Aloreover, if tuo deformations 
of p are isomorphic, then this isomorphism is unique. 


The proof of this theorem will be given in Section 3.2, following the Outline 
given in Section 1.1. 


3.1.3 In the statement of Theorem 3.1.1 we have assumed that the residye 
field k& of A is algebraically closed. This is more than we really need. A carefy] 
inspection of the proof of Theorem 3.1.1 shows that we only have to assume 
that the singularities of .V and the points on D = D xgk are k-rational 
and that & contains enough roots of unity. For instance, this holds if & js 
separably closed. But one can prove much more. 

Suppose F is a complete noetherian local ring with arbitrary residue field 
k. Let k* be a sepatable closure of k and let R be the strict completion of R 
with respect to k*. Let (X/R,D) be a projective marked nodal curve. Write 
X := X xpk for the special fiber and X := X xp AR. Let p:Y¥ 4 (X,D) be 
a tame admissible cover and p* its base change to k*. Base change from R to 
R induces a map 


- 


Def,(R) —> Def,-(R). (47) 


Let G := Gal(h?/k) be the Galois group of k. Note that the action of G on 
k* extends naturally to an action on R. 


a 


Remark 3.1.2 There is a natural action of G on Def;:(R). The map (47) 
induces a bijection Def,(R) = Def,(R)°%. In particular, if p is unramified 
over the singular points then there is a unique deformation p of p to R. 


The first two statement of this remark are a variant of Weil’s Descent 
Criterion and can be proved using étale descent. The third statement follows 
from the first two and from Theorem 3.1.1. Note that Theorem 3.1.1 can be 
applied to X/R. Hence we have a bijection Def;»(R) = 7T(R), where T(R) is 


a 


defined as in (46) in terms of coordinate systems of X at the singular points. 
The induced G-action on T(&) can be determined from the natural action of 
G on the strict complete local rings of the singular points of Y. In particular. 
if Y/R is smooth or if p is unramified over the singular points, T(R) has 
exactly one element. 

As remarked in the introduction, the results of [9] about deformation of 
mock covers with tame ramification can be reformulated in terms of tame 
admissible covers which are unramified over the singular points. Therefore. 
by Remark 3.1.2, Theorem 3.1.1 implies the results of [9]. However, the more 
general results of [10] on mock covers with wild ramification do not follow 


from Theorem 3.1.1. 
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3.1.4 Suppose that X is a smooth projective curve over a complete 
noetherian local ring A. and let D C X be a mark. We will call tame 
admissible covers of (.X¥, D) tamely ramified covers. 


Corollary 3.1.3 The functor 
Rev (XY) = Rev? (X) 


reducing tamely ramified covers of X to the special fiber X := X xpk is an 
equivalence of categories. 


Proof: In the case of smooth curves, Theorem 3.1.1 and Remark 3.1.2 show 
that tamely ramified covers lift uniquely. To prove Corollary 3.1.3, it remains 
to show the following. Let p: Y — (X,D) and 7: Z — (X,D) be tamely 
ramified covers. Denote their reductions to X by p and #7. Let f: Z —Y be 
an X-morphism. Then f lifts uniquely to an X-morphism f : Z > Y. 
Using the result of Section 2.3, it is easy to see that C := p-'(D) CY 
is a mark and f a tame admissible cover of (Y,C). We already know that f 
lifts to a tame admissible cover f : Z' — Y. It follows that po f: Z'’— X 
is a tame admissible cover. Moreover, po f is a lift of 7. But since lifting is 
unique we may assume that Z'’ = Z and po f =7. a 


In Section 4 we will use Corollary 3.1.3 to construct a specialization mor- 
phism for tame fundamental groups. 


3.2 Proof of the main theorem 


In this Section we give a proof of Theorem 3.1.1 following the outline given in 
Section 1.1. In 3.2.1 we choose an étale covering of the curve X and fix some 
notation. In 3.2.2 we prove the theorem in the special case of an artinian 
base ring. In Section 3.2.3 we prove the general case by successively lifting p 
to the artinian rings R, := R/m"+t' and applying Grothendieck’s Existence 
Theorem. 


eae Notations and assumptions are as in Section 3.1.1 and 3.1.2. In 
addition, we assume that A is artinian. This means that m¥ = 0 for some 
integer V > 0. The scheme Spec R consists of a single point and the closed 
embedding .Y C X induces a homeomorphism of the underlying topological 
spaces. Recall from 3.1.2 that 2,;,7 € J’, are the singular points of X, that y,. 
j € J’, are the singular points of Y and that the map «: J’ — I’ is defined 
by p(y)) = ei). We have chosen coordinate systems (u,,v;) (resp. (7;,3;)) 
for X at z,,72 € I’, (resp. for Y at y,, 7 € J’,) such that 7? = u,(,) and 
5; = Ux(j)- 

The closed subscheme D C X consists of a finite set of smooth points 
Teptyire 5 %y © A. Let I” := {r+1,...,9}. For? € I” lev gee) = ty; © Y | 
j € J"} be the fiber over z,, indexed by J”. Let J” := User Jl", I= UI". 
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J := J'UJ” and extend « toamapa: J — J with x(j) =7 for 7 € J". For aly 
2 € I", choose a coordinate =, € Ox, for D at x, (Proposition 2.2.4). For al} 
j) € J”, choose an element w, € Oy, such that Or, = Ox, [w,|w™ = = 
as in Proposition 2.3.4. ae oe pal | 

For 2 € J, let U, = Spec A, — X be an affine etale neighborhood of z,. We 
may assume that U; is a coordinate neighborhood, for the coordinate system 
(u;.v;) if 7 € J’ and for the coordinate z; of D if 1 € I”. Hence, we have 
étale ring morphisms R[w;,2; | uv, = ti] — A; for i € I’ and Riz,| — A, for 
~E€ 1". Let Uo := NX -—{z, |i E72} Cc X and Ip := TU {0}. We may assume 
that Up = Spec Ao is affine. Then U := (U;)ics is an affine étale covering 
of X. In the sequel we will keep the open subset Uo fixed and continue to 
shrink the neighborhoods U;, 7 € J, as necessary. Then & will always remain 
a covering of ¥. Hence we may assume that for all z € J the image of U;, 
on .X is contained in Uy U {z;}. This implies that for 7 # 7 the image of 
U;yi= U; Xx UZ on X is: contained,im Up. 

Let U := (U; = U; xx X )ieh be the restriction of U to X. Fori € J’ 
(resp. 2 € I”), U; := Spec A, = X is a coordinate neighborhood for (i,, i) 
(resp. for Z,), where A, = A;/mA,. Since p: Y — X is a finite map, we can 
write ¥ = Spec B for a finite Oy-algebra B. Let @ = (C;, a,,;) be the descent 
datum for B on U (Section 1.3). In particular, C; is a finite A,-algebra such 
that U; xx ¥ = SpecC;. Note that the neighborhoods U; of 2; can be made 
arbitrarily small by choosing U; small. We may therefore assume that we 
have C; = Bx(j) _;B, , where 

B, = Ajft;,3 (|Fe = ta 5 = Te. = 0] (48) 


m1’ J 


for j) € J! and 


B, = A, [ay | ey = "SE (49) 


for j € J” (see Lemma 1.2.3, Proposition 2.3.2 and Proposition 2.3.4). 


3.2.2 Let p : Y — X be a deformation of p to R. We can write Y = 
Spec B for a finite Ox-algebra B with B= B@rk. Let a = (Cj, a;,;) be the 
descent datum for B on U. It follows that @ =a @rk. By the Propositions 
2.3.3 and 2.3.4 ere are unique lifts (r,, $i) of oe (resp. w, of w,) with 
ps =u,j) ands = ug), J € J’ (resp. wi” = 24(5), 7 € J”). Moreover, we 
may assume that C; = xj): 8B), where 


n nr 
B, = Aj[r),3; | ty’ = vi 3; 


j=, 7,8) = 7, iF T,:=7,8,;ER (50) 


for 7 € Ji, 


B; = As, ws? = 2°) (51) 
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for j € J" and B, = B, @g fk for all 7 € J. Since p is étale over Up, Co is 
an étale Ag-algebra. For the same reason and our choice of the covering WU. 
C; @a, Ai, and C, @a, Ai, are étale A, ,-algebras, for 2,7 € 1,2 #7. 

Now let p’ iy “ —_. XN be another deformation of p to R inducing the 
same defsemation data (7)); as p. Let ¥’ = Spec B’ and a’ = (Cj, a;,;) be 
the corresponding descent datum for B’ on UW. Since Co and CG are dtale Ao- 
algebras with Co@pk = Co = Co@pk. there is a unique Ag-linear isomorphism 
fo: Co — Cg with f @gk = Idc, (Lemma 5.1.1). After shrinking the 
neighborhoods U;, 7 € I, we find A,-linear isomorphisms f; : C; — Ci with 
fi@pk = Ide, because (48) and (49) depend, up to unique isomorphism, only 
on the deformation data (7;);. Moreover, the f, are uniquely determined by 
pee values on the coordinates r,, 8; and w;. It follows from the Propositions 

2.3.3 and 2.3.4 that the f; are unique. We claim that the family (f,) is an 
‘somprpiient of descent data between a and a’. This follows from the fact 
that C;@.4, A), C) @a, Ary, Ci @a, Ar and C}@a4, Ai; are étale A, ;-algebras 
and from Lemma 3.1.1. By Corollary 1.3.2 ne family (f,) descends to a 
unique isomorphism f : ¥’ — Y between the deformations p’ and p. This 
proves the second statement of Theorem 3.1.1 and the injectivity of the map 
(46) in our special case. 

To prove the surjectivity of the map (46), let (7))) € T(R). Forz € J, 
define a finite A;-algebra C; = $,(j)=:B, by the expression given in (50) 
and (31). By Lemma 3.1.1 we can lift Co to a finite étale Ao-algebra Co. 
Using our rae on the covering U, it is easy to see that for allz 
j, 1,7 € Ip, C, @a, Aij and Cj @a, Aj are étale A;j-algebras. Therefore 
we can apply Lemma 5.1.1 once more to construct A,,;-linear isomorphisms 
az; 1 Ci @a;, Aig — Cj @a, Ain with a,; @p k = a,j. Let a := (Ci, a;,;). 
Since a @p k = a, the uniqueness statement of Lemma 5.1.1 forces a to be 
a descent datum. By Corollary 1.3.2, a determines a finite Ox-algebra B, 
hence a finite morphism p: Y = Spec B — X lifting p. By construction, p is 
a tame admissible cover. This completes the proof of Theorem 3.1.1 in the 
special case. 


3.2.3 We are now going to poe Theorem 3.1.1 in the general case. For 
n > 0, let R, := R/m"*) and X, := X xp R,. In particular, Xo = oe 
Let p: Y — x be a deformation i: p to R with deformation datum (7,)). 
Think of p as a finite Ox-algebra B with ¥Y = Spec B. The finite Ox,,- algebra 
B, := B@p Rn, corresponds to the deformavion oxrR, of p to Ry with 
deformation datum (7 ey Here Ze ) denotes the image of 7, in Ry. Using 
the special case of Theorem 3.1.1 and Grothendieck’s Existence Theorem 
(Corollary 1.4.2), we see that p is determined by the deformation datum (7;), 
up to unique isomorphism. 

Conversely, let (7;); € T(A) be a deformation datum. Applying Theorem 
3.1.1 in the case of an artinian base ring, ye obtain a compatible system of lifts 
Pn 2 Yn — Xp With deformation datum Ga .. Write ¥,, = Spec Bn, then B’ = 


_ 


(B,)n is a formal finite Oy-algebra. By Grothendieck’s Existence Theorem 
(Corollary 1.4.2) we obtain a finite Ox-algebra B with B, = B@pr R,. Let 
Y := Spec B. All we have to show is that p: Y — X is a tame admissible 
cover. 

Let x € X bea singular point of the special fibre and y € f71(r). Choose a 
formal coordinate system (u,v) for X at x and write A:= Oy, = = R[fu,vlur = 
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t]]. Since p is finite, B := = Oy, is a finite local A-algebra. It follows from 
Proposition 5.2.3 (ili) that Oy,,, A= a ae By the construction of ¥,, 
there exist elements r,,8, € B/m"*!B with r£ = un, 8° = Un, Tr = rise 
R, and B/m™*!B = R,[[ra,3n | PaSn = Tn)]- Moigouer: we may assume that 
(Tn41-3n41) lifts (Tn, Sn). Since Bis complete, we find elements r,s € B with 
image Ty,8, in B/m"*! such that r° = u, s* = v andr c= rs € R. By 
the local criterium of flatness ((17] 20.C Theorem 49), B is a flat R-algebra. 
Hence, by Proposition 2.1.1, we have B = R{[r,s|rs = TJ]. We have shown 
that p is tame admissible at y (Definition 2.3.1 (i)). By similar arguments 
one shows that the same is true if z is a smooth point of X. This completes 
the proof of Theorem 3.1.1. a 


4 ‘Tame fundamental groups of smooth curves 


Let X be a smooth projective curve over a complete local ring R with alge- 
braically closed residue field, and let D C X bea mark. Corollary 3.1.3 states 
that tamely ramified covers of the special fiber (X, D) lift uniquely to tamely 
ramified covers of (X,D). This result was first obtained by A. Grothendieck 
and used to prove his famous theorem stating that the prime to p part of the 
tame fundamental group of a smooth projective curve over an algebraically 
closed field of characteristic p is the same as it would be in characteristic 0. 

So far, [8] is the only complete reference for this result. For the special 
case of étale fundamental groups there is the more accessible account of [20]. 
In both expositions Grothendieck’s Theorem is deduced from facts about 
fundamental groups of rather general schemes. We will see in this section 
that the case of tame covers of smooth projective curves can be handled with 
much less machinery. We are roughly going to prove the following. Let R bea 
mixed characteristic discrete valuation ring with algebraically closed residue 
field of characteristic p > 0 and X a connected smooth projective curve over 
R with a mark DC X. Then there is a surjective specialization morphism 
from the tame fundamental group of the generic geometric fiber (Xx, Dj) of 
to the tame fundamental group of the special fiber (X, D). Moreover, this 
specialization morphism induces an isomorphism on the prime to p parts of 
the tame fundamental groups. 
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4.1 The tame fundamental group as a Galois group 


Let k be an algebraically closed field and X a smooth connected projec- 
tive curve over k. In this case, a mark D C X of X/k is a finite set 
D = {x,,... ,2,} of closed points of X. Tamely ramified covers of (X,D) cor- 
respond to finite tamely ramified extensions of the function field k(.X) of XY. 
Thus we can define the tame fundamental group 7?(X) as the Galois group 
of the maximal algebraic extension of k(X), tamely ramified over 4;,... ,z;. 
This is classical valuation theory for function fields in one variable. Moreover. 
if k is of characteristic 0, r?(X) is the profinite completion of the topological 
fundamental group 7,°P(.Y¢ — {2,... ,2-}). This follows from the Riemann 
Existence Theorem. In this subsection, we list all the facts we need about 
tame fundamental groups of smooth projective curves over an algebraically 


closed field. 


4.1.1 If a € X is a closed point, the local ring Ox.q is a discrete valuation 
ring with quotient field &(X). This induces a bijection between closed points 
of X and discrete valuations of k(X) which are trivial on k. If z € k(X) isa 
local parameter for a, the complete local ring is of the form Ox, = &{[z]]. 

If p: Y — (X,D) € Rer?(X) is a tamely ramified cover (see Definition 
2.3.1) and Y is connected, the function field &(Y’) is a finite extension of k(.X). 
tamely ramified over the valuations corresponding to the points 2,,... ,2,. 
In fact, let y € p~'(z,) and consider the natural extension Ox, —~ Oy,, of 
valuation rings: passing to the complete local rings, we obtain Oy, = A[[t]] = 
Ox.2,{wlw" = =] for a suitable choice of local parameters z and w and with n 
prime to the characteristic of k (see Definition 2.3.1 (1i)). 

Conversely, let L/k{X) be a finite field extension which is tamely ramified 
over Z;,... ,z, and unramified everywhere else. Then £ = k(Y) for a smooth 
connected projective curve Y over k. Moreover, the birational map induced 
by the inclusion k(.¥) — EL = k(Y) extends uniquely to a tamely ramified 
cover p: Y — (.X,D). 

This correspondence between function fields and connected curves carries 
over to non connected curves. Let p : Y — (X,D) be any tamely ramified 
cover and let \j,... ,Y, be the connected components of Y. The function 
ting of Y is defined as the finite k(X)-algebra k(Y’) := k(¥i1) ©... @ &(¥;). 
The extension k(¥")/k(X) is tamely ramified over 21,... ,2, and unramified 
everywhere else. We obtain an equivalence between Rev? (X) and the category 
of finite k(X )-algebras, tamely ramified over z,,... , 2, (from now on we will 
tacitly understand that ‘tamely ramified over z,,... ,2,' implies ‘unramified 
everywhere else ). 


4.1.2 Choose an algebraic closure k{(X) — 9 and let Q C 1 be the 
maximal subextension tamely ramified over z,,...,2,. Then 2/k(X) is a 
Galois extension. Choose a closed point a € X — {z,,... ,z,} and a discrete 
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valuation @ of 2 extending the valuation of &(.’) corresponding to a. Define 
the tame fundamental group of (.X,D) with base point a as the Galois 
group of 2 over k(.X): 


mP(X,a) = Gal(Q/k(X)). (52) 


It may seem strange that t?(X,a) does not depend on a. But the choice of 
a and @ determines the way in which w?(.X,a) classifies all tamely ramified 
covers of (X,D). For our purposes it suffices to consider only Galois covers 
OD (XS), 

Let G be a finite group. A G-Galois cover of (X,D,a) is a tamely 
ramified cover p : Y — (X,D) € Rev?(X) together with an isomorphism 
G — Autx(¥) such that ¥ is connected and G (as automorphism group of 
the cover p) acts transitively and without fixed points on the fiber p (aA 
pointed G-Galois cover of (X,D,a) is a G-Galois cover p: Y — (X.D) 
together with a choice of an element 6 € p~}(a). 

Let p: Y — (X,D) be a G-Galois cover. Then k(Y)/k(X) is a finite field 
extension and G acts as a group of k(X)-automorphisms on k(Y). We can 
identify p~1(a) with the set of valuations of k(Y) extending a. Since G acts 
transitively and fixed point free on this set, k(Y)/k(X) is a Galois extension 
with Galois group G. Moreover, if we choose an element b € p~!(a) there 
is a unique embedding A, : k(¥’) — 2 over k(X) such that Aj1(a) = 6 (as 
valuation of k(Y)). The restriction map on the Galois groups induced by », 
is a surjective continuous homomorphism 7?(X,a) > G. 

Conversely, any surjective and continuous homomorphism 7?(X,a) - G 
corresponds to a subfield £ C 2 which is Galois over k(X) with Galois group 
G. If Y is the smooth projective model of L, ie. L = k(¥’), we obtain a 
G-Galois cover p: Y — (X,D). Note that G acts without fixed points on 
p-*(a) because a is unramified in L = k(¥). Moreover, there is a unique 
distinguished element 5 € p~'(a) corresponding to the restriction of @ on 
L =k(Y). We have proved the following: 


Proposition 4.1.1 The choice of a induces a bijection between isomorphism 
classes ohpoutsed G-Galois covers of (X,D,a) and surjective continuous mor- 
phisms x? (X,a) > G. 


Let g,r > 0. Let [,, be the profinite group with 2g + r generators 
@1,01,..., Gg, bg, Cy,... ,¢, and the single relation 
(Gyyby| oases O/C eae = Y, (53) 
Remark 4.1.2 If the ground field & is of characteristic 0 and the curve X of 
genus g, we obtain an isomorphism 


~ 


Dae syrcendme se | nee a). 
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To prove this fact, one may assume that k is a subfield of C. One can show 
that the natural homomorphism 7?(X¢,a) ~ #?(X,a) is actually an iso- 
morphism, because any G-Galois cover of (Xc, Dc) is already defined over k. 
By the Riemann Existence Theorem, 7? (X¢,a) is isomorphic to the profinite 
completion of the topological fundamental group 7;°°(X¢ — {21,... ,2;},a). 
which is [,,. See e.g. [24] for more details. 


4.2 ‘Tame covers over complete discrete valuation rings 


In this section R will be a complete discrete valuation ring with algebraically 
closed residue field k. We denote by K the quotient field of R and by A an 
algebraic closure of A’. We work out several details about smooth projective 
curves and tamely ramified covers over R which are used in the proof of 
Grothendieck’s Theorem. 


4.2.1 Purity of branch locus First we need some preliminaries about 
regular local rings of dimension 2. See [17], Chapter 17, or [2] VIII §3 for 
more details. 

Let A be a noetherian local domain; denote its residue field by k, its 
maximal ideal by m and its quotient field by A’. The ring A is regular of 
dimension d iff its completion A is. 

From now on, assume A to be regular of dimension 2. Let S,4 be the set 
of discrete valuations v of A’ such that the valuation ring o, contains A. If 
v € S4 then p := AMM, is a prime ideal of height 1. The localization A 
is a discrete valuation ring, hence 0, = A. Regular local rings are normal, 
theretore equal to the intersection of their localizations at primes of height 
one. We see that 


A= [\e (54) 


ves 4 


Let v € S4 and p:= AM m, be as above and choose a € p ~ m’. By [2] VIII, 
5.3 Proposition 2 and Corollaire 1 we conclude that A:= A/aA is regular and 
that (a) is a prime ideal. Since dim A = 2 we have in fact p = (a), and A is 
a discrete valuation ring. Moreover, we have v'(a) = 0 for all v’ € S4 — {v}. 
We will call an element a € A with the above properties a parameter of A 
at v. 

The following lemma is a special case of the Purity Theorem of Nagata 
and Zariski, also called ‘Purity of branch locus’. Its proof is an adaption of a 
proof given in the Appendix of [4]. For much more general versions of Purity, 
see e.g. (8) X.3. 


Lemma 4.2.1 Let A be a noetherian regular local domain of dimension 2. 
the quotient field of A and L/W a finite ertension. Then the integral closure 
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B of Ain L is a finite A-module of rank [ZL : KN}. Moreover, B is étale over 
A if L/Iv is unramified over S4 (the set of valuations dominating A). 


Proof: Let Sg be the set of discrete valuations w of L lying over some 
valuation v € S4. Denote the corresponding valuation ring by O,, and put 


Bon (ex) (55) 


From (54) we can see that A C B. As intersection of integrally closed domains 
of L, B is itself an integrally closed domain of L. Hence, if we can show that 
B is a finite A-module then it will follow that B is the integral closure of 4 
in L. 

Let v € S4 be any valuation of A’ dominating A and choose a parameter 
a for A at v. Since the localization Aj) is a discrete valuation ring, Big) := 
NwivsOw is the integral closure of Ai) in L. By standard valuation theory 
Bia) is a free Aja)-module of rank [L : A]. Therefore Bi.)/aBia) is a free 
Aa)/@Ayq)-module of rank [Z : A). Let A:= A/aA and B:= B/aB. Using 
the fact that v(a) = 0 for all v € Sy — {v,} it is easy to see that the A-algebra 
B := B/aB injects into Biay/aBiay. Since A is a discrete valuation ring, it 
follows that B is a free A-module of rank < [LZ : A]. Nakayama’s Lemma 
implies that B can be generated (as an A-module) by {L: A’] elements. But 
[ is the quotient field of B, therefore B is free over A of rank exactly [L : A). 

Let b,,...,6, be an A-basis of B and 6 be the discriminant of B over A 
with respect to b,,...,0,. Now £/K is unramified over S4 iff v(é) = 0 for 
all v € Sy iff 6 € A” iff B is étale over A (Lemma 5.1.2 (iv)). This completes 
the proof of the lemma. ra 


If L/Jv is tamely ramified in only one place of S4, we can use Abhyankar’s 
Lemma to prove another version of purity. Its formulation is simpler if we 
work with strict complete local rings. 


Lemma 4.2.2 In the situation of Lemma 4.2.1, assume that A is complete 
with algebraically closed residue field. Assume moreover that L/KX is tamely 
ramified over some valuation vg € S4 and unramified over S4 — {vo}. Then 
L/W is purely ramified at vp and the integral closure of A in L is of the form 


B= Alb’ |b =a), 


Here € is the ramification index of vg in L anda € A is a parameter of A at 
Ug. 


Proof: For any natural number ¢ prime to the residue characteristic of vg 
we put A. := Iv [b|b® = a]; this is a finite extension of A’ of degree e, purely 
ramified in vp and unramified over S4 — {vg}. Moreover, A, := Alb | b* = a] 
is the integral closure of A in A: it is a complete regular local domain of 
dimension two (see [2] VIII, 5.4). 
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Let L, := L£- A, (inside some algebraic closure of A’). By Abhyankar’s 
Lemma (see e.g. [20], Appendix to Chapter IX) and the hypothesis we can 
choose € such that £,/A. is unramified at the unique extension of vg to A. 
Hence it is unramified at all discrete valuations dominating A,. Now Lemma 
4.2.1 applies, showing that the integral closure B, of A, in L, is finite étale 
over A,. But A, is strictly henselian, therefore we have B, = A, (Proposition 
5.2.3 (v) ) and ZL C A. In particular, vg is purely ramified in LZ. Hence, if 
we choose € to be its ramification index, then L = A, and B = A,. a 


4.2.2 Specialization Let X be aconnected smooth projective curve over 
R. It is a regular scheme of dimension 2. We will use the notation X := Xxpk 
for the special fiber, Xx := X xpA for the generic fiber and Xz := X xp 
for the geometric generic fiber. Note that X is an integral scheme, because it 
is both regular and connected (use [14] II Proposition 3.1 and the fact that a 
regular connected scheme is irreducible). We will denote its function field by 
K(X). 

Let z be a closed point of the generic fiber X, (which we consider as an 
open subset of XY). The point x corresponds to a discrete valuation of A(X) 
which is trivial on AY. The local ring Ox, is the corresponding valuation 
ring. Let A’ be the residue field of z and FR’ the integral closure of R in A’. 
Then RF’ is a complete discrete valuation ring, purely ramified over R. Since 
X/R is projective, the morphism Spec A’ — X corresponding to z extends 
uniquely to a morphism Spec AR’ — X (({14] I], Th. 4.7 and Th. 4.9). Let z 
be the image of the special point of Spec R’ on X. The point ¢ is the unique 
closed point of the special fiber X contained in the Zariski closure of z on X. 
We will call z the specialization of xr and write x ~ Z. 

Let z € X C X bea closed point of the special fiber. The local ring Ox. + 
is a regular local ring of dimension 2 with quotient field A(X). Let S; be 
the set of discrete valuations of A(X) dominating Ox. A valuation v € S; 
corresponds to a point on .Y of codimension 1 whose Zariski closure contains 
rz. Hence we can identify 5; with the set {cr € Xx | x ~ z} U{n}, where 7 
is the generic point of the unique irreducible component of X containing 7. 
Therefore (54) becomes 


Oxz =(f{ ) Oxz)NOxa. (56) 


Ter 


Let Dc X be a mark on X (see Section 2.2). In particular, D = Spec R’ 
for a finite étale R-algebra R’. But since F is strictly henselian we have R’ = 
R®...G@R. Theretore D is the Zariski closure of a finite set {z,,...,2,} C 
Xp of Av-rational points whose specializations 2,,...,2, € X are pairwise 
distinct. We will identify D with the set {z,,... ,2,} and write X(J‘) for the 
set of /y-rational points of X(/\). 
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Lemma 4.2.3 Let X be a smooth projective curve over R and let D = 
{r1,...,.2,} be a mark on X. 


(1) For every closed point z € X there exists a pointr € X(N) with x ~ z. 


(ii) Let p: ¥Y — (X,D) be a tamely ramified cover anda € X(K’) such that 
its specialization @ is not an element of the reduction {Piss g ae of 
D. Then for any point b € p~‘(a) there is a unique point b € Y(Kh) 
with b ~ b and p(b) =a. 


Proof: Let a € X. By Proposition 2.2.3 we have Ox — R[[=]] for some 
local coordinate z. Hence we can embed K(X) (which is the quotient field of 
Ox a) into A’((z)). This defines a discrete valuation v, with residue field Ky 
on A(X). It is clear that the point z € X(J’) corresponding to vz specializes 
to z. This proves the first claim. In the situation of (ii), p is étale at 5. 
Therefore Ox, — Oy5, proving the second claim. a 


4.2.3 Reduction Now we assume in addition that the geometric fibers 
Xx and X are connected. For any finite extension A’/K, let R’ be the 
integral closure of R in A’. Then X':= X xp RF’ is a smooth projective curve 
over Ff’. The scheme X’ is still connected, has special fiber X and function 


field A'(X") = K(X) @x KB" 


Proposition 4.2.4 Let pp : Yr — (Xx, Dr) be a tamely ramified G-Galois 
cover. Assume that the order of G is prime to the characteristic of k. Then 
there exists a tamely ramified cover p: Y — (X,D) with pp =p xp. Its 
reduction p: Y —(X,D) is a G-Galois cover. 


Proof: By assumption Yj is connected. The G-Galois cover pr is already 
defined over some finite extension A’ of A, i.e. there is a tamely ramified 
G-Galois cover px : Ye — (Xx, Dx) with pre = per xx: A. Note that 
the corresponding extension of function fields A’(¥x1)/K'( Xx) is a G-Galois 
extension. Our assumption on G allows us to apply Abhyankar’s Lemma. 
More precisely, after a tamely ramified extension of A’’ we may assume that 
the discrete valuation of A’(.X’) corresponding to the generic point of X is 
unramified in A’(¥x:). 

Let Y’ be the normalization of X‘in A’(Y-) and p': Y' + X’ the natural 
map ({14] If Ex. 3.8). By construction Y’ is a normal integral scheme with 
function field A’(¥’) = A'(Y_-). We claim that p’ is a tamely ramified cover 
of LX 2D 

Let zt € X bea closed point and let B be the integral closure of Ox: 
in A'(Yp-). By Lemma 4.2.1 B is finite over Oxrg. Therefore the fiber 
(p')-*(z) = {t1,... Ys} is a finite set and B is its semi-local ring (i.e. its 
localizations are exactly Oy.,y,). In particular, p’ is a finite morphism. 
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Let S; := {x € Xx | x ~ F}U {n} be the set of discrete valuations 
of A(X) specializing to @. If  ¢ {#....,2,} then A’(Yq:)/K'(Xp-) is 
unramified over Sz. If t = z; then it is tamely ramified over z; and unramified 
over Sz — {xz;}. Using Lemma 4.2.1 and Lemma 4.2.2, we conclude that p’ is 
a tamely ramified cover of (.X’, D’). 

Let p := p'x pk be the reduction to the special fiber: it is a tamely ramified 
cover of (X,D). By Corollary 3.1.3 it lifts uniquely to a tamely ramified cover 
p:¥ — (X,D). The uniqueness of lifting implies p’ = p xp R’. Therefore 
PK =pPXR XK. 

Note that there is a natural G-action on Y’ inducing a G-action on Y and 
hence on Y. To prove that p: ¥ + X is in fact a G-Galois cover, it remains 
to show that Y is connected. Assume that Y is the disjoint union of two open 
subset ¥,,¥. Then the natural maps ¥; — (X, D) are still tamely ramified 
covers, for2 = 1,2. Lift them to tamely ramified covers ¥; — (X,D),7 = 1,2. 
By the uniqueness of lifting, we conclude that Y is the disjoint union of Y; 
and ¥2. But Y is an integral normal scheme and hence connected ({14] II 
Proposition 3.1). Therefore one of the ¥; must be empty, proving that Y is 
connected. a 


4.3. The specialization morphism 


Let G be a profinite group. We define the prime to p part of G as the 
inverse limit over those finite quotients of G which are of order prime to p and 
denote it by G®. It is a profinite quotient of G. If 6: G = H is a continuous 
morphisms of profinite groups, @ factors to a continuous morphism o” : G?’ — 
H”’. Note that surjectivity of 6 implies surjectivity of o?’. Remember that 
I’, denotes the tame fundamental group of a smooth projective curve of 
genus g with r marked points in characteristic zero (Remark 4.1.2). 


Theorem 4.3.1 (Grothendieck) Let X be a smooth projective curve of 
genus g over an algebraically closed field k of characteristic p > 0 and 
dg, 21,... ,2, be pairwise distinct closed points. Let D := {z,,...,2,}. Then 
there is a surjective homomorphism of profinite groups 


LD oa P(X, ao) 
inducing an isomorphism on the prime to p parts, 1.e. hae — 7? (X.a9)” 
The proof of this theorem will be given in the rest of this section. In 
4.3.1 we construct a lift of X to characteristic 0. In 4.3.2 we construct the 
specialization morphism between tame fundamental] groups. In 4.3.3 it is 
shown that this specialization morphism is injective on the prime to p: parts. 


4.3.1 Let X be a smooth projective curve over an algebraically closed 
field of characteristic p > 0 and let D = {z,,... ,z,} be a mark on X. Let 
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R := W(k) be the ring of Witt vectors over k: it is a complete discrete 
valuation ring of characteristic 0. We adopt the notations A’, A’ etc. from 
Section 4.2. By Proposition 4.3.2 below we can lift Y to a smooth projective 
curve .X over A: its geometric generic fiber X¢ is connected. By Lemma 
4.2.3 we can lift %,,...,z2, € NX to A-rational points ry,..., z, © X(R), 
This-deninés:-a: mark) t= {2}40. 42,)-0n X. 


Proposition 4.3.2 Let X be a smooth projective curve over k. 
(i) There exists a smooth projective curve X over R with X =X xp k. 


(ii) If X is connected, then for any X as in (i) the generic geometric fiber 
Xp t= X XR WV is connected. 


Proof: We will give a proof of (i) only in characteristic different from 2. For 
the general case, see e.g. [8] III. If char(k) # 2 then there exists a simple 
morphism f : X — Pi (see [5] Proposition 8.1). In particular, f is tamely 
ramified, having only ramification of order 2. Therefore we can lift f to a 
tamely ramified cover f : X — Ph (Corollary 3.1.3). This proves (i) in this 
special case. 

Assume that X is connected and that X is a lift of X as in (i). Let 
R be the integral closure of R in A. Then Xp := X xp R is a smooth 
projective curve over the valuation ring R with special fiber X. Since the 
natural morphism Xp — Spec F is closed ([14] II Theorem 4.9), every closed 
subset of Xp must intersect the special fiber X. Therefore Xp is connected. 
A connected regular scheme is integral. Therefore X; C Xp is integral and 
in particular connected. | 


4.3.2 Let (X, D) be as constructed in 4.3.1. Choose a closed point do € 
X — {2,,...,2,}. By Lemma 4.2.3 (i) we can choose a point a, € X(fv) - 
{r,,...,2,} which specializes to do. We will regard ag (resp. a,) as base 
points on X (resp. Xx). As in 4.1.2, choose an algebraic closure k(X) — Mo 
and let Q9 be the maximal subextension of Ng which is tamely ramified over 
I1,...,2,. Extend the valuation on k(X) corresponding to dg to a valuation 
Gy on Qo. Similarly for Xx: let Q, be an algebraic closure of A(X), Q, the 
maximal subextension tamely ramified over z;,... ,z, and @ a valuation of 
(2; lying over a,. As in 4.1.2, we define the fundamental groups 


me ® (Xp, a1) = Gal(Q/K(X)), 2?(X,ao) = Gal(Q/k(X)) (37) 


We can interpret Lemma 4.2.3 as follows. The process of specialization a, ~~ 
dg is a ‘path’ in X — D connecting ag with a,, and tamely ramified covers 
of (.X, D) have a unique lifting property with respect to such paths. In close 
analogy to tgpological ne space theory, this lifting property defines a 
morphism 7; “(X,,a,) —+ 7?(X,ao) of fundamental groups. 
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Proposition 4.3.3 There exists a natural surjective morphism of profinite 
groups 


O:mP*(Xp,a1) —+ P(X. ao). 


Proof: Let G = 7?(X,ao)/H be a finite quotient. By Proposition 4.1.1 it 
corresponds to a pointed G-Galois cover p : ¥ + (X,D). Let bp € p-+(ao) 
be the distinguished point. 

Corollary 3.1.3 states that we can lift f to a tamely ramified cover p : 
Y — (X,D) with automorphism group G. In particular, G acts on Yj. 
Lemma 4.2.3 (i1) shows that specialization defines a G-equivariant bijection 
py (41) — p~'(ag). Hence G acts transitively and fixed point free on py (41). 
By Proposition 4.3.2 (11) ¥x is connected. Hence pg : Yr — Xx is a G-Galois 
cover. Let b; € pz-(a1) be the unique lift of bp over a, (Lemma 4.2.3 (ii) ). 
Consider pg as a pointed G-Galois cover with distinguished point b;. By 
Proposition 4.1.1, pz corresponds to a surjective continuous homomorphism 
OH: Ty K (Xr, ay) —G. 

We claim that the maps oy are compatible (where H runs over all normal 
subgroups of finite index) and lift to the desired surjective morphism 6. To 
prove this we have to show the following: if G’ = t?(X,dq)/H’ is a smaller 
quotient than G, ie. H C H’, and y : G — G’ is the natural map, then 
{0 On = On. : 

Let p’ : Y’ — X be the pointed G’-Galois cover corresponding to the 
natural map 7?(X,a9) — G’ (with distinguished point 05). Let p’: Y’ — X 
be its lift to R and b, be the lift of bg in (p’)~*(a,). Let Ay : K(¥') — 
Q, be the unique embedding such that \;,'(a) is the valuation on K(¥’) 
corresponding to the point ;. The map On is the restriction map of Galois 
groups corresponding to A». 

By the definition of G and G’ we have a natural inclusion &(¥’) C 
KY) C Qo of Galois extensions, such that the restriction map from G = 
Gal(h(Y)/k(. X)) to G’ = Gal(k(¥")/k(X)) is the natural map. It induces 
an X-morphism f : ¥ — Y’ such that f(b) = 8). By Corollary 3.1.3 f 
lifts uniquely to an X-morphism f : ¥Y — Y’. The uniqueness statement of 
Lemma 4.2.3 (ii) implies that f(b,) = 04. Let x: A(Y’) ~ K(Y) be the 
inclusion of function fields induced by f. We have 


Aye = As, © Hs (58) 


because both embeddings restrict a, to the valuation corresponding to 04. If 
we translate (58) into the corresponding equation for maps between Galois 
groups, we obtain 7) o @y = @y. This proves the proposition. | 


4.3.3 We are now going to finish the proof of Theorem 4.3.1. Remember 
that AY is an algebraically closed field of characteristic 0. By Remark 4.1.2 
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we can identify aPR (e+4,) with [,,. Thus, the proof of Theorem 4.3.1 is 
complete if we are able to show that the morphism © given in Proposition 
4.3.3 is injective on the prime to p parts. 

Reconsidering the construction of the specialization morphism, we see that 
we have to prove the following. Let G be a finite quotient of 7; *(Xx.a,) of 
order prime to p. Then the natural map ¢ : aR (Xe, ay) — G factors over 
O. 


¢ corresponds to a punctured G-Galois cover py : ¥x — Xx (with dis- 
tinguished point b, € p7'(ai). By Proposition 4.2.4 we can find a tamely 
ramified cover p: Y —- X with pe =p xr. Its reduction p:= pxprk is 


a G-Galois cover, which we consider as punctured by the specialization bg of 
b,;. Therefore p corresponds to a surjective map y : t?(X,a9) — G. By the 
construction of @ we have 7: = yoo. al 


5 Appendix 


For the convenience of the reader, this appendix contains some results about 
étale ring extensions and henselian rings which are used in this paper. Our 
main reference is [21]. See also [18] Chapter I, §3. 


Bel Let A,B be rings, ¢ : A — Ba morphism and q € SpecB. A — 
B is called unramified at q, if pB = gq and the residue field extension 
A/pA — B/qB is separable (where p := yo4(q) € Spec A). We say 
that B/A is unramified, if it is unramified for all q € Spec B. Assume for 
a moment that B is finitely generated as an A-algebra. Then A — B is 
unramified at q iff A — B is ‘net’ at q (in Raynaud’s terminology, see [21] 
I, Def. 4) iff (Q8/4) = 0 (see (21] III, §4, Prop. 9 u. Ex. 1). Since Qgya is a 
finitely generated B-module, being unramified is an open property on Spec B 
(supp 28/4 C Spec B is closed). 

A -— B is called étale, if it is of finite presentation, flat and unramified. 
A — B is called formally étale if for every A-algebra C and ideal Id C 
with J* = 0 the canonical map 


Hom,(B,C) — Homa(B,C/I) (59) 


is a bijection. By [21], Chapitre I, Definition 2 and Chapitre V, Corollaire 1. 
A — B is étale if and only if it is of finite presentation and formally étale. 


Lemma 5.1.1 Let A be a ring with a nilpotent ideal I; let A:=A/I. Then 
every finite étale A-algebra B lifts uniquely to a finite étale A-algebra B. 


Proof: This follows from [21] V, Thm. 4, and from Nakayama’s Lemma. 


Lemma 5.1.2 Let A be a ring and B,C be A-algebras. 
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(i) IfA— B is étale, then so is C — B@4C. 


(11) Let B — C be an A-algebra morphism. If A— B and A —C are étale, 
then B— C is étale. 


(ii) Jf A— B and B —C are étale, then A—C is étale. 


(iv) Assume that B is a finite free A-algebra and let (6g/4) 4 A be the dis- 
criminant ideal. Then A — B is étale if and only if dp;4 € A*. 


Proof: As remarked above, a ring extension is étale if and only if it is formally 
étale and of finite presentation. This implies the Claims (i), (ii) and (iii), 
because the analogous statements hold both for formally étale and for finitely 
presented morphisms. To prove (iv), note that A — B is étale if and only if 
it is unramified. By [21] III Proposition 10 and 11, A — B is unramified if 
and only if B@,4(A /pA ) is a separable A /pA -algebra, for all p € Spec A. 
It is well known that the latter holds if and only if 65/4 ¢ p (see e.g. [18] III 
Proposition 3.1). i 


5.2 Let A be a ring and A — k a morphism to some field k. Denote by 
Ei(A — k) the category of étale A-algebras B equipped with an A-algebra 
morphism B — k. Morphisms between objects B — k and C — k are 
A-algebra morphisms B — C’ compatible with the maps to k. If such a 
morphism exists, then we say that C — k is smaller than B — k. A ring is 
called indecomposable if it is not the direct sum of two rings (equivalently, 
Spec A is connected, {14] II Ex. 2.19). Let Ei’(A — k) be the full subcategory 
of indecomposable objects of Ei(A — k). 


Lemma 5.2.1 Let A — k be as above and B—k, C +k e€ H(A — &). 
If C is indecomposable, then there is at most one morphism from B — k to 
C —k. In any case, there exists an indecomposable D — k € Et'(A — k) 
smaller than B — k andC — k. 

Proof: See {21] VIII §2, Prop. 2. al 


The Lemmas 5.1.2 and 3.2.1 allow us to define the direct limit 


? 


A:=lim A’, Al— ke EA &). (60) 


The ring A is called the henselization of A with respect to A — k. There 
is a natural A-algebra morphism A — k. If k& is separably closed, then A is 
called a strict henselization of A at Ker(A — k). 

A local ring A is called henselian, if every finite A-algebra is the direct 
sum of local factors (see [21], I, Def. 1). Or, equivalently, if A verifies Hensel's 
Lemma ((21] I, Prop. 5). The ring A is called strictly henselian if it is 
henselian and its residue field is separably closed. From [21] Chap. VII. we 
get 
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Remark 5.2.2 A is a local henselian ring and its residue field A/m is the 
separable closure of A/p inside k. A — A is flat and unramified. If A is 
noetherian, A will be noetherian, too. A — k satisfies the following universal 
property: for every henselian local A-algebra B, equipped with an A-algebra 
morphism B — k, and whose residue field contains the separable closure of 
A/p inside , there is a unique A-algebra morphism A — B compatible with 
the maps to k. 


We define the completion A of 4 with respect to A — kas the completion 
of A at its maximal ideal. If & is separably closed, A is called the strict 
completion of A at Ker(A — k). 


Proposition 5.2.3 Let A be a noetherian ring, A— B be a finitely presented 
morphism and B — k a morphism to a field k. Then: 


(i) There are natural local morphisms A — B (resp. A> B) between the 
henselizations (resp. completions) with respect to k. 


(ii) If A— B is flat, AS B and A— B are faithfully flat. 
(iii) If B = A/I for some ideal I d A, then B= A/IA and B= A/IA. 


(iv) A — B is unramified at q := ker(B — k) iff A — B is surjective iff 
A— B is surjective. 


(v) A — B is étale at q iff A — B is an isomorphism iff A —~ B is an 
isomorphism. 


(vi) A is integrally closed in A. 


Proof: 

(i): A — B exists by the universal property of henselization (see Remark 
3.2.2). The existence of A — B is a direct consequence. 

(ii): If A — B is flat, its base change A — B®, A is flat, too. But 
B @, A is the limit of she B@sA SkKEH(Bs k), where A’ —- k runs 
over Ei’(A — hk). Conclude with [21] VIII, §2, Prop. 5 that B is also the 
henselization of B@4 A w.r.t. its natural morphea to k, i.e. Ba A— Bis 
flat. Now transitivity of flatness and {17] 4.4 shows that A — B is faithfully 
flat. As A is noetherian, all other rings we use are noetherian, too, and 
A ad A, BoB are faithfully flat ({17] 23.L Cor. 1). Look at the factorization 
A+A® @j B — B. The first arrow is faithfully flat by the base change rule, 
the second one is completion at the maximal ideal of A®; B, hence faithfully 
flat, too. Again, transitivity shows that A — B is faithfully flat. 

(ili): We have a natural morphism A/IA — B. Since A is henselian 
and A — A/IA a finite local morphism, A/IA must be henselian. By the 
universal property of henselization, there is a morphism Be A/IA, which 
is easily seen to be the inverse of the morphism of the first sentence. 
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(iv): If A — B is unramified at q, then it is net in a neighborhood of 
q. By [21] V, §1 Thm. 1, we can find A’ — hk € Et'(A 4k) and B’ ke 
E1'(B — k) with B’ = ANT! Now (iii) implies B = A/TA. The surjectivity 
of A+B immediately implies the surjectivity of A — B. Now assume the 
latter. Then A — A and A — B are unramified, hence there composition is 
unramified, too. Thus, A /p — B/q— B/ing i is separable and pB = tmp 
(where m4, Mg are the maximal ideals of A, B and p := ker(A — k)). Now 
the faithful flatness of B — Band iy qB implies pB, = q, hence A — B 
is unramified at q. 

(v): If A — B is étale at (and then also around) q, the definition of the 
henselization immediately shows that A = B, which trivially implies A&B. 
Assuming the latter, we conclude as in the proof of (iv) that A — B is 
unramified at q. Now the faithful flatness of A — A and B — B and [17] 
4.B show that A — B is flat, therefore A — B is étale in a neighborhood 
of q. 

(vi): Let a € A be an element which is integral over A. Then A’: 
Ala] C A is a finite local extension of A. But the completion of A and A’ is A 
for both rings. By (v), A A’ must be étale. Therefore A’ = A anda € A. 

o 
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